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(57) Abstract 

The invention relates to new expression systems and in particular to an expression system in which a gene of interest is expressed 
at an optimal level. The invention provides a recombinant expression vector comprising a gene of interest and a selectable marker gene, 
wherein the selectable marker gene is arranged downstream of the gene of interest and a stop codon associated with the gene of interest is 
spaced from a start codon of said selectable maiicer gene at a distance which is sufficient to ensure that translation reinitiation is required 
before said selectable marker protein is expressed from the corresponding MRNA. Examples of such expression systems are vector viral 
packaging cell lines and a number of preferred cell lines have been identified. 
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Expression systems 
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The present invention relates to new expressions systems, 
and in particular to expression systems in which a gene of 
interest is expressed at an optimal level. Particular 
examples of such expression systems are retroviral packaging 
cell lines and a number of preferred cell lines have been 
identified. 



The ability of eukaryotic and prokaryotic ribosomes to 
reinitiate translation at an internal start codon within an 
mRNA sequence has previously been recognised. Studies have 
been reported in which the efficiency of the process, which 

15 is generally regarded as being low, has been connected with 

the length of the intercistronic sequence (Kozak (1987) 
Mol. Cell Biol. 7, 343 8-3445) . Selection of this sequence 

or spacer as 70bp in length, and containing no other start 
codons, has been previously reported as being optimal for 

20 reinitiation in a eukaryotic cell line (Cosset F-L. , 

Virology (1991) 185, 862) . 

The applicants have found a way in which the inefficiency 
associated with the translation reinitiation process can be 
25 used to good effect. 

According to the present invention there is provided a 
recombinant expression vector comprising a gene of interest 
and a selectable marker gene, wherein the selectable marker 

30 gene is arranged downstream of the gene of interest and a 

stop codon associated with the gene of interest is spaced 
from a start codon of said selectable marker gene at a 
distance which is sufficient to ensure that translation re- 
initiation is required before said selectable marker protein 

35 is expressed from the corresponding mRNA. 



SUBSTITUTE SHEET (RULE 26) 
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The invention further provides a process for producing cell 
lines in which a gene of interest is expressed, which 
process comprises transforming host cells with an expression 
vector comprising said gene of interest and a selectable 
marker gene, wherein the selectable marker gene is arranged 
downstream of the gene of interest and a stop codon 
associated with the gene of interest is spaced from a start 
codon of said selectable marker gene at a distance which is 
sufficient to ensure that translation re-initiation is 
required before said selectable marker protein is expressed 
from the corresponding mRNA, and selecting those cells where 
expression of the selectable marker gene may be detected. 

Since re- initiation of translation is a relatively 
inefficient process, this means that the selectable' marker 
protein will be expressed at lower levels than the product 
of the gene of interest. When the marker protein is 
expressed at detectable levels, the gene of interest will be 
expressed at higher levels. This will ensure that during 
the subsequent selection procedure , only those cell clones 
which express the gene of interest at higher or optimal 
levels will survive. Lo'w expressing clones will be 
eliminated by the selection process. 

25 Cells transformed with the above -described expression 

vectors form a further aspect of the invention. 

The host cells are suitably eukaryotic or prokaryotic host • 
cells, preferably eukairyotic host cells. 

30 

The number of nucleotides in the space between the stop ' 
codon of the gene of int.erest and the start codon of the 
selectable marker will suitably be in the range of from 20- - 
200 nucleotides, preferably from 60-80 nucleotides, even 
35 more preferably 70-80 nucleotides. 



20 
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The vectors used in the process of the invention may be any 
of the known types, for example expression plasmids or viral 
vectors . 

5 Selected cells may be cultured and if required, the protein 

product of the gene of interest isolated from the culture 
using conventional techniques. Alternatively, expression of 
the gene of interest may result in other desired effects, 
for example, where the gene of interest is included as part 
10 of a viral packaging construct. 

Some experimental and clinical gene transfer protocols 
require the design of gene transfer vectors suitable for in 
vivo gene delivery (Miller, A.D. 1992. Nature 357:455- 

^5 4 60) . Retroviral vectors are attractive candidates for 

such applications, because they can provide stable gene 
transfer and expression (Samarut J. et al.,Meth. Enzymol . in 
press) and because packaging cells have been designed which 
produce non-replication competent viruses (Miller A.D (1990) 

20 Hum Gene Ther . 1 5-14) . However currently available 

recombinant retroviruses suffer from a number of drawbacks. 

Packaging cell lines provide in trans the retroviral 
proteins encoded by the gag , pol, and env genes required to 

25 obtain infectious retroviral particles. The gag and pol 

products are respectively the structural components of the 
virion cores and the replication machinery (enzymes) of the 
retroviral particles whereas the env products are envelope 
proteins responsible for the host -range of the virions and 

30 for the initiation of infection and for sensitivity to 

humoral factors. An ideal packaging cell line should 
produce retroviruses that only contain the retroviral vector 
genome, and absolutely no replication-competent genomes or 
defective genomes encoding some of the viral structural 

35 genes. 
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A number of packaging cell lines designed for human gene 
transfer have been designed in the past by introducing 
plasmid DNAs which contain "helper genomes" encoding gag, 
pol and/or env genes into cells. 

Recroviral packaging cell lines are cells that have been- 
engineered to provide in trans all the functions required to 
express infectious retroviral vectors. A helper genome (or 
construct or unit), is herein also referred to as 
"retroviral packaging construct (or unit)" or "packaging- 
deficient construct (or genome unit)" or "gag-pol/env 
expression plasmids" . 

Much efforts has been made to design strategies to optimize 
the helper-genomes in order (i) to get the highest 
production of retroviral packaging functions (which 
correlates which infection titers of retroviral particles) 
and (ii) to minimise the chance that the helper genome can 
be transmitted via the viral particles (which may lead to 
emergence of unwanted retroviral forms) . 

The first of these packaging cell lines used full length 
retroviral genomes as helper genomes that had been crippled 
for important cis-regulated replicative functions (reviewed 
in Miller, Hum. Gene. Ther. 1:5-14 1990). In order to reduce 
the possibility of occurrence of replication-competent 
viruses and of transfer of virus structural genes, a second 
generation of safer packaging cell lines has been designed-, 
by using two separate and complementary helper genomes which' 
express either gag-pol or env and are packaging-deficient 
(Miller supra) . ; , 

The cells into which these helper genomes were introduced 
were isolated by cotransf acting them with plasmids encodihg 
selectable markers. However, as no selection was applied on 
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the packaging-deficient retroviral genome itself, the" helper 
functions can be lost during the passages of the cells in 
culture and the current packaging systems provide limited 
titers of infectious retroviral vectors, usually only of the 
order of 10^-10^ infectious units i.u/ml. Indeed the 
cotransfection with a plasmid encoding a selectable marker 
does not directly select the best gag-pol-env-expressing 
cells • 



The invention further provides a retroviral packaging cell 
line comprising a host cell transformed with (i) a packaging 
deficient construct which expresses a viral gag-pol gene and 
a first selectable marker gene, and/or (ii) a packaging- 
deficient construct which expresses a viral env gene and a 
second selectable marker gene; wherein a start codon of the 
first and second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that reinitiation 
of mRNA translation is required for expression of marker 
protein product of said first and/or second selectable 
marker gene. 

The retroviral packaging cell line may be obtained by the 
above described process which will involve selecting 
transfected cells which express said first and/or second 
marker genes. 

By using helper constructs which are directly selectable and 
which provide for high expression of the viral gene, high 
titre retroviral vectors may be obtained. 

Helper constructs for use in the process form a furtner 
aspect of the invention. 

The retroviral vectors prepared from the conventional 



9708330A1_L> 



wo 97/08330 PCT/GB96/02661 



packaging cell lines are usually not contaminated by " 
replication-competent retrovimses (RCRs) . However, 
recombinant amphotropic murine retroviruses have been shown 
to arise spontaneously from certain packaging cell lines. 
5 The generation of such RCRs involves recombination at least 

between gag-pol/env packaging sequence and vector sequences 
(Cosset et al.. Virology, (1993) 193:385-395). ' 

Recombinant RCRs have been associated with the development 
10 of lymphomas in some severely immunosuppressed monkeys 

(Donahue et al . , J. Exp Med (1992) 176: 1125-1135). In 
addition, retroviral vector preparations may also contain', 
at low frequencies, retroviruses coding for functional 
envelope glycoproteins (Kozak and Kabat, 1990, J. Virol. 64: 
15 3500-3508) or for gag-pol proteins. Although the 

pathogenicity of these gag-pol or env recombinant 
retroviruses is probably low, more evolved recombinant 
retroviruses with higher pathogenic potential may occur when 
injected in vivo , by recombination and/or complementation ^ of 
20 the initial recombinant viruses with some endogenous 

retroviruses . 

In a preferred embodiment of the retroviral packaging cell 
lines of the invention, the overlapping sequences between.. 
25 the. genomes of the retroviral vector and the helper 

construct are reduced, for -example as compared to constructs 
such as CRIPenv and CRIPAMgag (Danos et al . , Proc . Natl. 
Acad. Sci USA 85: 6460-6464). In particular, the viral 
sequences in the helper construct are reduced, for example, " 
not only the packaging sequence but also the 3' Long - ' 
Terminal Repeat (LTR) , the 3' non-coding sequence and/or the 
5'LTR may be eliminated. 



30 



35 



The possibility of generation of such RCRs and recombinant' 
retroviruses can be reduced by reducing the overlapping ' 
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sequences between the genomes of both the retroviral vector 
and the helper construct . 

Conventional retroviral vectors are strongly inactivated by 
5 human serum which makes them of limited or no use for in 

situ gene transfer in gene therapy applications. It has 
previously been shown that inactivation by complement in 
human serum is controlled by the cell line used to produce 
the virions and by viral envelope determinants (Takeuchi et 

10 al., J. Virol (1994) 68:8001-8007), In particular, 

inactivation is caused by some properties of the cell lines 
that have been used to construct the packaging cells (NIH- 
3T3) and also by viral determinants located in the 
retroviral envelope as shown (Takeuchi et al . , J. Virol 

15 (1994) 68:8001-8007) , In vivo gene delivery is an important 

goal for a number of human gene therapy strategies . 

The applicants have found that certain cell lines form 
preferred packaging cell lines. 

20 

Particularly preferred packaging cell lines are the HT1080 
line, the TE671 line, the 3T3 line, the 2 93 line and the Mv- 
1-Lu line. One example of retroviral packaging cells that 
will -produce complement-resistant virus comprise human 
25 HT1080 cells and express RD114 envelope. Such cells form a 

preferred aspect of the invention. 

Packaging cell lines according to the invention provide 5 0- 
100 fold increased titers of retroviral vectors as compared 
to conventional packaging cell lines. Retroviral vectors 
provided by these new cells are safe, in terms of generation 
of RCRs, and considerably more resistant to inactivation by 
human complement . 

3 5 Packaging cell lines according to the invention may be able 
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to transduce helper-free, human complement -resistant " 
retroviral vectors at titers consistently higher than lo^ 
1 . u , /ml . 

suitable semi -packaging cell lines in accordance with the 
xnvention are those which express only the gag-pol genes 
Such cell lines may suitably be derived from TE671, MINK Mv- 
1-Lu, HT1080, 293 or NIH-3T3 cells by introduction of 
plasmid CeB (the MoMLV gag-pol expression unit). 

Particularly preferred expression vectors in accordance with 
. the invention for use in retroviral packaging cell line. • are 
-those whxch include MLV gag and pol genes such as CeB 
Other plasmids may include gag and pol genes from other 
15 retroviruses or chimeric or mutated gag and pol genes. 

various viral and retroviral envelope genes may be included 
in the plasmids such as MLV-A envelope, GALV envelope, VSV-G 
protein, BaEV envelope, RD114 envelope and chimeric or 
20 mutated envelopes. Plasmids which include the RD114 env 

gene such as FBdelPRDSAF as illustrated hereinafter, provide 
one example of suitable constructs . 

The novel retroviral packaging cells described hereinafter' 
have been designated FLY cells, and may be designed for in' 
vivo gene delivery. 



25 



considerable variations were found between the various e^i 

30 retroviruses. In addition, few cell lines were able to" • 

produce retroviruses completely resistant to human ' • 

complement. Based on these two criteria, human fibrosarcoma 
HT1080 and rhabdomyosarcoma TE671 cells were selected ' f dir> 
optimum construction of packaging cells. - W 



BNSDOCID: <WO_9708330A1„I_> 



wo 97/08330 v||V PCT/GB96/02061 



Other studies have shown the importance of endogenous" 
retrovirus expression in the generation of recombinant 
retroviruses from retroviral packaging lines (Ronfort et 
al.. Virology, (1995), 207, 271-275, Vanin, E.F.etal., J 
5 Virol (1994) 68:4241-4250,). The co-packaging of an 

endogenous genome and a vector can lead to emergence of 
recombinant retroviruses (Vanin et al., supra). 
Recombination involves template switching during reverse 
transcription of such hybrid retroviruses (Hu et al,, 

10 Science, (1990) 250:1227) and homologies between the two 

genomes considerably enhance the frequency of reverse 
transcriptase jumps (Zhang et al . , J. Virol. (1994) 68: 
2409-2414). Therefore an ideal packaging cell line should 
not express endogenous MLV-like (or type C retrovirus - like) 

15 retroviral genomes which can be packaged by type C gag 

proteins (Scadden et al . , J. Virol. (1990) 64: 424-427, 
Torrent et al . , J. Mol . Biol. (1994) 240 434-444). 



20 



Packaging of human endogenous retroviral RNA was not 
detected in TELCeB and FLY packaging cells when virion 
associated RNA was analysed by RT-PCR using 'generic primers. 

HT1080- and TE671 derived packaging cell lines may be safer 
in this respect than those generated from NIH3T3 cells, sueh 
as GP+EAM12 cells, which are known to express and package 
25 sequences related to type C retroviruses (Scadden et al . 

supra) . 

To generate the FLY packaging cell lines, HT1080 cells were 
transfected with gag-pol and env expression plasmids 
designed to optimise viral protein expression. Direct 
selection for viral gene' expression was achieved in 
accordance with the invention by expression of a selectable 
marker gene by re-initiation of translation of the mRNA 
expressing the viral proteins. This strategy resulted in 
3 5 packaging cell lines capable of producing extremely high 
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titer viruses. Furthermore, long-term expression of " 
packaging functions can be maintained in these cells. Many 
unnecessary viral sequences were eliminated from the 
packaging constructs to reduce the risk of helper virus 
generation; indeed the final packaging cells did not produce 
helper virus, in that no replication competent virus (RCR) 
could be detected per 10' vector particles. 

The FLY packaging cells described herein are safer than for 
example, psiCRIP cells, at least for generation of env 
recombinant retroviruses as is illustrated in Table 4 
hereinafter, probably because less retroviral sequences ' 
overlapping with the vector were present in the present env- 
expression plasmid. Few reports have addressed the question 
of the characterization of recombinant retroviruses (RVs) 
(Cosset, F.L., et al. , Virology (1993) 193:385-395) It is 
possible that such RVs could not be detected in previous 
packaging cell lines due to lower overall titers. RVs are 
defective in normal cell culture conditions but are likely 
to evolve to replication competent viruses if they are 
allowed to replicate in cells complementing their expression 
like co-cultivated packaging cells (Bestwick et al . , Proc 
Natl Acad Sci USA, (1988) 85: 5404-5408, Cosset et al 
(1993) supra) . 

In preferred retroviral packaging systems according to the 
invention, RVs are eradicated for example by removal of 
viral LTRs from the packaging construct. 

Consistent with our previous studies (Takeuchi, Y et al 
J Virol (1994) 68:8001-8007), LacZ(RD114) and lacZ '(MLV-A) ' ' 
pseudotypes produced from HT1080 and TE67lcells were more 
resistant to human complement than LacZ(RDll4) or LacZ (MLV- 
A) pseudotypes produced by 3T3 of dog cells. It was 
therefore decided to use RD114 and MLV-A env genes to 
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generate recombinant virions with MoMLV cores. 

The sequence of RD114 env gene was determined and is shown 
in Figure 4. It was found to be very close to BaEV (baboon 
endogenous virus) a type C retrovirus (Benveniste, R.E.et 
al-, Proc. Natl. Acad. Sci. USA (1973) 70:3316-3320; Kate, 
S.et al., Japan, J. Genet. (1987) 62:127-137) with an 
envelope gene displaying similarities to the external part 
of type D simian retroviruses (SRVs) . RD114 uses the SRV 
receptor on human cells (Sommerfelt & Weiss, Virology 
(1990) 176:58-69; Sommerfelt, M.A. et al . , J Virol (1990) 
64:6214-6220) making the FLY packaging cells with RD114 
envelope capable of generating virions with different 
tropism. Retroviral vectors prepared so far for human gene 
therapy have used either MLV~A or GALV (gibbon ape leukemia 
virus) envelopes which display some similarities (Battini, 
J-L.,et al., J Virol. (1992) 66:1468-1475) and which use two 
related cell surface receptors for infection (Miller, D.G. 
et al., J Virol (1994) 68:8270-8276). Differences in tissue- 
specific expression of MLV-A or GALV receptors have been 
reported (Kavanaugh et al., Proc Natl Acad Sci USA (1994) 
91:7071-7075) . 



The invention will now be particularly described by way of 
example with reference to the accompanying drawings in 
which: 

Figure 1 . illustrates the structure and expression of CeB. 
The env gene (Xbal-Clal) of plasmid pCRIP was removed and 
was replaced by coinsertion of the two fragments Xbal-Sfil 
(restriction sites underlined) from pOXEnv and a Sfil-Clal 
PGR product containing the bsr selectable marker. This ' 
results in positioning the bsr start codon (shadowed) 74 bp 
downstream to the pol stop codon (bold) . 
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Open triangle are start codons (303 and bsr) , black " 
triangles are stop codons (^ol and bsr) . The shadowed 
trxangle is the start codon of env, in the same reading ' 

frame with that of bsr. SD and SA a t-^ i • 

T . the splice donnor and 

splice acceptor sites - 

i 

Figure 2 illustrates the structure and expression of 

FbdelPASAF . ■! 

immediately after the stop codon of env (bold) was inserted 
a non retroviral Kasl-Ncol (restriction sites underlined.) 
linker which positions the ^hleo start codon (shadowed) 76 
op downstream. 

open triangle are start codons (env and ^hleo) , black 
triangles are stop codons (env and phleo) . SD and SA are 
the splice donnor and splice acceptor sites. 

Figure 3 illustrates plasmids for expression of A^pho, ECo, 
RD114, Xeno, lOAl, GALV, VSV-G and FeLVB envelopes 
All genes are expressed in the same backbone as detailed in 
fxg. 2. The Bglii sites for ecotropic (MoMLV strain) 
lOAl, xenotropic (NZB.1.V6 strain) and amphotropic (4070A 
strain) , the Ndel site of RD114 (SC3C strain, the BamHl site 
for both FeLVB and GALV were used as 5' ends, and linked to 
Mscl site immediately after the splice donor site in the : 
25 leader of FB29 LTR. 

Figure 4 shows the sequence of the RD114 env gene (SEQ ID No 



Figure 5 shows the genetic structure of gag-pol constructs. 
Initiation W and termination (▼) codons are shown The 
thick dotted line below each construct shows MLV-derived 
sequences. Nucleotide positions of MLV-derived sequences are 
shown according to: Shinnick et al . (1981) (from nt 1 to nt 
6000 with deletion of the packaging signal (DY) from Ball 
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(nt 215) to PstI (nt 568), and with some further MoMLV 
sequences in both CeB and CeB DS- from nt 7676 to nt 7938. 
gag-pol and bsr genes were expressed from the same 
transcription unit using the either a retroviral promoter 
5 (Mo LTR) or a non retroviral promoter (hCMV) and non 

retroviral polyadenylat ion sequence (polyA) . Splice donor 

(SD) and acceptor (SA) sites are indicated. The thin line 
denotes retroviral non coding sequences. The thick line 
shows the rabbit beta-1 globin intron B. The position of 
10 some restriction sites is indicated. 

The nucleic acid sequences of portions of constructs (as 
shown in Figure 5 (boxed areas)) are displayed for CeB (SEQ 
ID No 2, Figure 6) , hCMV+ intron (SEQ ID No 3 , Figure 7) and 
15 hCMV+intronkaSD (SEQ ID No 4, Figure 8) . 



The nucleic acid sequences of portions of constructs (as 
shown in Figure 3 (boxed areas)) are displayed for 
FbdelPASAF (SEQ ID No 5, Figure 9), FbdelPMOSAF (SEQ ID No 
6, Figure 10) , FbdelPGASAF (SEQ ID No 7, Figure 11) , 
FbdelPRDSAF. (SEQ ID No8 , Figure 12) and CMVlOAl (SEQ ID No 
9, Figure 13) are shown. 

The components of the viral particles are produced by two 
independent expression plasmids ( craa - pol or env ) which also 
contain selectable markers (bsr or phleo) expressed from the 
same transcriptional units as gag-pol or env (figs. iSc 2) . 
The selectable markers are located downstream to qag - ool or 
env genes and there is an optimal distance between the stop 
codon of the upstream reading frames and the start codon of 
the selectable genes that should allow re- initiation of 
translation (Kozak, Mol Cell Biol. (1987) 1 , :3438-3445) . 
Because there is no "Kozak" sequence (Kozak, Cell, (1986) 44: 
283-292) required for a normal initiation of translation for 
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the marker gene, they can only be expressed by re-initiation 
of translation after the upstream viral gene has been 
successfully expressed. Consequently and also because re- 
initiation of translation is a poorly efficient process, 
after transfection of these plasmids , cells resistant td the 
drugs corresponding to those selectable genes express hfgh 
levels of the viral proteins. 

To avoid viral transmission of these "helper" genomes the 
constructs used suitably have the classical deletions of 
both the packaging, sequence located in the leader region, and 
of the 3'LTR, the latter being replaced by SV40 
polyadenylation sequences (Figs 1 & 2) . ! 

15 Plasmid CeB is the MoMLV gag-pol -expression unit. it- 

derives from pCRIP, a plasmid used to generate the 
constructs introduced in the CRIP and CRE packaging cell' 
lines (Danos and Mulligan, 1988) . As shown in fig. i f6r 
generation of plasmid CeB the env gene of pCRIP has been' 

20 deleted mostly and the hMJC selectable marker, -encoding a 

protein conferring resistance to blast icidin (Izumi et al., 
Experimental Cell Research (1991) 197, 229-233)- has been 
inserted downstream to eoI gene. There are exactly 74 bp 
with no ATG triplets between the stop codon of E2i and the 

25 start codon of bsr, this allows its expression by re- ' 

initiation of translation on the gag-pol mRNA, after- • ' 
translation of the 333-201 reading frame. 



30 



FbdelPASAF is a plasmid expressing the amphotropic env gene 
and the EhX^ selectable marker conferring resistance to'. 
phleomycin (Gatignol et al . , FEBS Letters (1988) 230:171- 
175) . By using a PCR-mediated mutagenesis strategy which 
modifies the end of env gene (see fig. 2), a 76 bp linker- 
was inserted between the stop codon of env and the start ' 
3 5 codon of Ehleo. This allows expression of phlen from the 
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15 

env mRNA by re-initiation of translation. In addition 
compared to known env -expressing constructs, this strategy 
of construction has reduced the length of sequences 
overlapping with the ends of conventional retroviral 
vectors. The env genes of Mo-MLV, FeLVB, NZB.1V6, lOAl, 
GALV and RD114 are expressed by plasmids FBdelPMoSAF, 
FBdelPBSAF, FBdelXSAF, FBdelpGSAF, FBdelplOAlSALF and 
FBdelPRDSAF, respectively, by using the same backbone as 
FBdelPASAF (fig. 3) . Retroviral vectors produced with the 
RD114 envelope will be useful for in vivo gene delivery as 
comparatively to MLV ecotropic or amphotropic envelopes, 
virions pseudotyped with RD114 envelopes are not inactivated 
by human complement when they are produced by Mink Mv-l-Lu 
cells or by some human cells (Table 1) . 



The HT10 80 cell line, isolated from a human fibrosarcoma 
(ATCC CCL121) . The TE671 cell line isolated from a human 
rhabdomyosarcoma (ATCC CRL 8805) (purchased from ATCC, and 
tested for absence of usual cell culture contaminants by 
ECACC) , has been used for the definitive construction of 
packaging cell lines. HTIOBO line was chosen among a panel 
of primate and human lines because MLV-A and RD114 
efficiently rescued retroviral vectors from these cells and 
also because RD114 pseudotypes produced by this cell line. 
25 were stable when incubated in human serum. In a standard 

assay (Takeuchi et al . , J Virol (1994), G8, 8001-8007), 
these latter viruses were found more than 50 0 fold more 
stable than similar pseudotypes produced in 3T3 cells. 

30 Another advantage for the use of non murine cells to derive 

packaging lines is the absence of MLV-related endogenous 
retroviral -like sequences (like VL30 in 3T3 cells) that can 
cross -package with MLV- derived retroviral vectors (Torrent 
et al., 1994) and generate potentially harmful recombinant 

35 retroviruses. 
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The helper constructs were introduced into other cell" lines 
(HT1080 (table 2) MinkMv-l-Lu (table 2), 3T3 (not shown), 
TE671 (table 2)) for the purpose of comparisons of the ' 
efficiency of the constructs. 

As illustrated hereinafter (Table 2), the reverse 
transcriptase (RT) activity (provided by expression of the 
pel gene) in cells transfected with CeB is significantly 
higher than that of the same cells transfected by the 
parental plasmid pCRIP or that of cells chronically infected 
by MLV. This enhancement of viral gene expression is 
correlated with the titers of lacZ retroviral vectors when 
an envelope is provided in CeB-lacZ cells after comparison 
with titers of lacZ pseudotypes of either replication- 
competent viruses or other helper- free packaging systems. 

For the generation of final packaging cell lines, the best 
clonal env transf ectants have been selected. Packaging 
systems obtained in this way will be able to produce helper- 
free retroviral vectors at titers greater than io« 
infectious particles per ml, which would be 10-100 fold 
higher to helper-free preparations of others. 

Because of the way the selectable markers are expressed (see 
above) , growing the packaging cells in phleomycin and 
blasticidin selective pressure increase and stabilize the 
expression of the retroviral components and particularly ^ the; 
envelopes, as it is -possible that env glycoproteins have ' 
toxic effects for the producer cells in the long term which 
may lead to a decrease of expression. 



Such an enhancement of viral production observed with the 
packaging systems described herein might increase the 
emergence of unwanted retroviruses having recombined between 
35 the genomes of both the retroviral vector and either of the 
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two packaging-deficient constructs. However, the constructs 
have been designed in such a way that it reduces the 
probability of emergence of recombinant viruses compared to 
the parental constructs. To check their safety, attempts 
5 have been made to detect the presence of replication- 

competent retroviruses by a mobilisation assay of a lacZ 
provirus. No RC viruses have been found in all retroviral 
vector preparations tested so far. 

10 The following Examples illustrate the invention. 

Example 1 

Preparation of Cell lines and viruses. 
15 The following cell lines were used: 

A204 (ATCC HTB 82), HeLa (ATCC CCL2 ) , HT1080 (ATCC CCL121), 
MRC5 (ATCC CCL171) , T24 (ATCC HTB 4), VERO (ATCC CCL81) and 
D17 (ATCC CCLil83) were purchased from ATCC. 

20 HQS, TE671 and Mv-l-Lu cells and their clones harboring 

MFGnlslacZ retroviral vector as described by Takeuchi et 
al., J Virol (1994), 68, 8001-8007. 

The above cell lines were grown in DMEM (Gibco-BRL, U.K.) 
25 supplemented with 10% fetal calf serum. 

EBB (Battini et al . , J. Virol (1992) 66: 1468-1475); 
psiCRE, psiCRELLZ and psiCRIP (Danes et al . , Proc . Natl. 
Acad. Sci USA (1988) 85: 6460-6464) ; 
30 Cells GP+EAM12 (Markowitz et al., Virology (1988), 167, 400- 

406) ; and 

NIH-3T3 murine fibroblasts. 

These cell lines .were grown in DMEM (GIBCO-BRL, U.K.) 
3 5 supplemented with 10% new-born calf serum. 
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Mv-l-Lu, TE671 and HT1080 cells were transfected using 
calcium-phosphate precipitation method {Sambrook et . , 
"Molecular Cloning" 1989, Cold Spring Harbour Laboratory 
Press: New York) as described elsewhere (Battini et al . , 
5 supra) . CeB- transfected Mv-l-Lu, TE671 and HTIOBO cells 

were selected with 3, 6-8 and' 4 ng/ml of blasticidin S (ICN, 
UK) , respectively, and blasticidin-resistant colonies were 
isolated 2-3 weeks later. Cells transfected with the various 
env-expression plasmids were selected with phleotnycin 
10 (CAYLA, France) : 50 fig/ml (for FBASALF- transfected cells) or 

10 fig/ml (for FBASAF- , FbdelPASAF-, FbdelPMOSAF, 
FBdelPlOAISAF or FBdelPRDSAF- transfected cells) '. Phleomycin- 
resistant colonies were isolated 2-3 weeks later. 



15 



20 



30 



35 



Production of lacZ pseudotypes using replication competent 
viruses, amphotropic murine leukemia virus (MLV-A) 1504 
strain and cat endogenous virus RDll4 , was carried out as 
described previously (Takeuchi et al . , j Virol (1994), 68, 
8001-8007) . 

Exan^le 2 



Preparation of Plasmids. * 
The env gene of pCRIP (Danes et al . , supra) was excised by 
25 Hpal/Clal digestion. A 500 bp PCR-generated DNA fragment, was 

obtained using pSV2-bsr (Izumi et al . , Experimental Cell 
Research (1991), i97, 299-233) as template and a pair of 
oligonucleotides: 

(5 ' >CGGAATTCGGATCCGAGCTCGGCCCAGCCGGCCACCATGAAAACATTTAACATTTC 
TO (SEQ ID NO 2) at 5 ' end and 

(5'>GATCCATCGATAAGCTTGGTGGTAAAACTTTT) (SEQ ID No 3 ) at 3 ' 
end, with Sfil and Clal sites, respectively. This fragment 
was inserted in Hpal/Clal sites of pCRIP by co-ligation with 
a 85 bp Hpal/Sfil DNA fragment isolated from pOXEnv (Russell 
et al.. Nucleic Acids Research (1993), 21, 1081-1085) which 
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provides the end of the Moloney murine leukemia virus* 
(MoMLV) pol gene. The resulting plasmid named CeB (Fig. i) 
. could express the MoMLV gag-pol gene as well as the bsr 
selectable marker conferring resistance to blast icidin S, 
5 both driven by the MoMLV 5'LTR promoter. 

A series of env-expression plasmids was generated using the 
4070A MLV (amphotropic) env gene (Ott et al.,, j Virol 

(1990) , 64, 757-766) and the FB29 Friend MLV promoter 

10 (Ferryman et al . , Nucleic Acid Res (1991), 19, 6950). In 

FBASALF (Fig. 1) a Bglll/Clal fragment containing the env 
gene was cloned in BamHI/Clal sites of plasmid FB3LPh which 
also contained the C5 7 Friend MLV LTR driving the expression 
of the phleo selection marker. A 13 6 bp env fragment was 

15 generated by PGR using plasmid FB3 (Heard et al . , j Virol 

(1991) , 65, 4026-4032) as template and a pair of 
oligonucleotides: ( 5 ' >GCTCTTCGGACCCTGCATTC ) (SEQ ID NO 4) at 
5' end (before Clal site) and 

(5' >TAGCATGGCGCCCTATGGCTCGTACTCTATAGGC) (SEQ ID NO 5) at 3' 

20 end, providing a KasI restriction site immediately after the 

env stop codon. This PGR fragment was digested using Clal 
and KasI, A DNA fragment containing the FB29 LTR and the 
MLV-A env gene was obtained by Ndel/Clal digestion of 
FBASALF. The fragments were co-ligated in Ndel/KasI digested 

25 pUT626 (kindly provided by Daniel Drocourt, CAYLA labs, 

France) . In the resulting plasmid, named FBASAF (Fig. i) , 
the phleo selectable marker was expressed from the same mRNA 
as the env gene. A Bglll restriction site was created after 
the MscI site at position 214 in the FB29 leader by using a 

30 commercial linker (Biolabs, France) . A Ndel/Bglli fragment 

containing the FB2 9 LTR was co- inserted with the Bglll/Clal 
env fragment in Ndel/Glal -digested FBASAF plasmid DNA, 
resulting in plasmid FBdelPASAF (Fig. 1) . Compared to 
FBASAF, FBdelPASAF has a lOObp larger deletion in the leader 

3 5 region. 
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Example 3 



Cloning and Sequencing of the RD114 env gene • 
The RD114 env gene was first sub-cloned in plasmid 
5 Bluescript KS+ (Stratagene) as a 3 Kb Hindlll insert 

isolated from SC3C, an RD114 infectious DNA clone (Reeves et 
al., J. Virol (1984), 52. 164-171). A 2 . 7 kb Seal-Hind 
fragment of this subclone containing the RD114 env gene was 
sequenced (Figure 4 (SEQ ID NO 1)- EMBL accession number; 
10 X87829) . The 5' non-coding sequence upstream of an Ndel site 

was deleted by an EcoRI/Ndel digestion followed by fillihg- 
in with Klenow enzyme and self -ligation . From this plasmid, 
two DNA fragments were obtained: a BamHl/NcoI 2 . 5 Kb 
fragment and a 63 bp PCR-generated DNA fragment using 
15 (5'>CGCCTCATGGCCTTCATTAA) (SEQ OD NO 6) at 5' end (before 

NotI site) and (5 ' >TAGCATGGCGCCTCAATCCTGAGCTTCTTCC) (SEQ ID 
NO 7) at 3' end, providing a KasI restriction site just 
after RD114 env gene stop codon. The PCR fragment was 
digested with Ncol and KasI. Both fragments were co- 
20 inserted between Bglll and KasI sites of FBdelPASAF and the 

resulting plasmid was named FBdelPRDSAF (Fig. 1) . 

Plasmid pCRIPAMgag- (Danos, O. et al., Proc Natl Acad 
Sci USA (1988) 85:6460-6464) was used for transf ect ion . 

25 Example 4 

Infection assays . 

Target cells were seeded in 24-multiwell plates (4x10^ cells 
per well) and were incubated overnight. Infections were then 

30 carried out at 37°c by plating 1 ml dilutions of viral 

supernatants in the presence of 4 ^g/ml polybrene (Sigma)' on 
target cells. 3h later virus -containing medium was replaced 
by fresh medium and infected cells were incubated for two 
■days before X-gal staining, performed as previously 

35- . described (Tailor et al . , J Virol (1993), 67, ' 6737-6741 , 
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Takeuchi et al . , J Virol (1994), 68, 8001-8007). Viral 
titers were determined by counting lacZ-positive colonies as 
previously described (Cosset et al , , j. Virol. (1990) 64: 
1070-1078). Stability of lacZ pseudotypes in fresh human 
5 serum was examined by titrating surviving virus after 

ll^' incubation in 1:1 mixture of virus harvest in serum-free 

medium and fresh human serum for 1 h at 37^C as described 
before (Takeuchi et al. supra) . 



10 Example 5 

Reverse transcriptase (RT) assay. 

RT assays were performed either as described previously 
(Takeuchi et al . supra) or using an RT assay kit (Boehringer 
15' Mannheim, U.K.) following the manufacturer's instruction but 

using MnCls (2 raM) instead of MgCl2 . 



Example 6 

20 Screening producer cell lines. 

Viral particles generated with RD114 envelopes have been 
found to be more stable in human serum than virions with 
MLV-A envelopes and that the producer cell line also 
controls sensitivity (Takeuchi et al . supra) . A panel of 

25 cell lines was screened for their ability to produce high 

titer viruses and for the sensitivity of these virions to 
human serum. To do this, cells were infected at high 
multiplicity with lacZ pseudotypes of either MLV-A or RD114 
and cells producing helper -positive lacZ pseudotypes were 

30 established. Human HT1080 and TE671 and mink Mv-l-Lu cells 

were found to release high titer lacZ(RD114) and lacZ (MLV-A) 
viruses. LacZ (MLV-A) pseudotypes produced by HT1080 cells 
were more resistant to human serum than those produced by 
other cells. The titer of these viruses was only -four- fold 

35 less following a 1 hr incubation with human serum than a 
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control incubation (Table 1). LacZ(RD114) pseudotypes' 
produced by human cells or mink Mv-l-Lu cells were in 
general stable in human serum (Table 1) . These results 
suggested that HT1080, TE671 and Mv-l-Lu cells provided the 
best combination of high lacZ titers and resistance to human 
serum and they were therefore used for the generation of 
retroviral packaging cells. 

Table 1. Titer and stability of lacZ pseudotypes . 



Producer 
cell 



LacZ (MliV-A) 



Titer* 



LacZ (RDii^i) . 



Stability** Titer^ 



Stability** 



A2 04 


650 


<3 


1,200, 


105 


HeLa 


9 


nd 


2 , 000 


115 


HOS 


4, 500 


6 


23 , 000 


86 


HT1080 


2 , 000, 000 


26 


400, 000 


129 


MRC-5 


450 


10 


1,000 


nd 


T24 


350 


nd 


1,200 


nd 


TEG 71 


15, 000 


2 


90, 000 


38 


VERO 


260 


nd 


90 


nd 


D17 


900 


<1 


200, 000 


1 


Mv-l-Lu 


80, 000 


1 


200, 000 


120 


a: titration on TE671 cells as lacZ 


i .u - /ml 






b: % of infectivity 


of human serum- 


treated viruses 


compared to 


fetaX calf serum 


treated viruses 











Example 7 



Construction of an improved gag-pol expression vector, 
.A MoMLV gag-pol expression plasmid, CeB (Fig. i) , was 
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derived from pCRIP (Danos et al . , Proc . Natl. Acad Aci USA 
(1988) 85: 6460-6464). Approximately 2 Kb of env sequence 
were removed from pCRIP and the bsr selectable marker, 
conferring resistance to blasticidin S (Izumi et al.. 
Experimental Cell Research (1991) 197:229-233), was inserted 
74 nts downstream of the gag-pol gene. This 74 nts interval 
had no ATG triplets and was thought to provide an optimal 
distance between the stop codon of the pol reading frame and 
the start codon of the bsr gene to allow re- initiation of 
translation (Kozak Mol Cell Biol., 1987, 7: 3438-3445). 
There was no "Kozak" consensus sequence (Kozak Cell, (1986) 
44: 283-292) at th^ 5' end of the marker gene. Therefore, 
bsr could only be expressed by re-initiatiori of translation 
after the upstream gag-pol gene had been expressed. 
15 Consequently, after transfection of CeB in Mv-i- 

Lu/MFGnlsLacZ (ML), TE671/MFGnlsLacZ (TEL) orHTlOBO cells, 
blasticidin S -resistant bulk populations and most cell 
clones expressed high levels of gag-pol proteins assessed by 
the reverse -transcriptase (RT) activity found in cell 
20 supernatants (Table 2) . Considerably higher RT activities 

were found in bulk populations of CeB-transf ected ML cells 
compared to bulk population of ML cells stably transf ected 
with the parental pCRIP construct. Similarly the RT 
activities of two packaging cell lines generated using 
25 pCRIPenv- construct, psiCRE cells (Danos et al . , supra) and 

EB8 cells (Battini supra.) were less than that of CeB 
transf ected clones (Table 2). Finally, RT activitiy in CeB 
transfected cell supernatants was higher than that of cells 
chronically infected by replication-competent MLV-A (Table 
30 2) . 

Table 2 . Secreted reverse transcriptase expression 



Cell^ RT activity*' LacZ Titer^ 



35 
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MT, /MTA7- a 


1 


SxlC* 




0 , 1 


<1 




0 . 15 


nd 


MLiCeB iDUlk) 


1 . 7 


nd 




4 . 2 


ixio^ 


IS/IT /^ri'D /I 


1 . 6 


IxlO^ 


T /MT TT 71 

IriLj/jyiLjV-A 


3.6 


2x10^ 




5 .2 


4x10^ 


rl J. X U o U / JYIij V - A 


1 . 1 


1x10^ 


HTCeB6 


1.9 


1x10^ 


HTCeBia 


2.7 


2x1 0« 


HTCeB22 (FLY) 


6.9 


5x10^ 


HTCeB4 8 


5.5 


3x10^ 


EB8 


0 .22 


1x10^ 


psiCRE-LLZ 


1.2 


1x10^*^ 



a: ML, Mv-l-Lu cells harboring a MFGnlslacZ provirus; TEL. TE671 cells harboring 
a MFGnlslacZ provirus; /MLV-A, cells chronically infected- with MLV- A 1504 
strain; MLSvB, ML cells transfected with a plasmid pSV2bsr alone; MLCRIP. ML 
cells CO- transfected with pCRiP and pSV2bsr. 

b: Average of arbitrary units relative to ML/MLV-A RT activity of at least tWo 
independent experiments was shown. The standard errors did not exceed 20 % of 
the values . 

c: titration on TE671 cells as lacZ i.u./ml. After polyclonal transfection 'at a 
plasmid which expresses MLV-A env in MLCeB clones, TELCeB clones, HTCeB clones 
and EB8 cells; nd, not done, 
d: titration on NIH3T3 cells 

To rescue infectious lacZ viruses, MLCeB and TELCeB clones 
were transfected with FBASALF DNA, a plasmid designed to 
express the MLV-A env gene (Fig. 1) . Bulk populations of 
stable FBASALF transf ectants were isolated and supernatants 
were titrated using TE671 cells as targets. Titers of lacZ 
viruses were higher than either MLV-A infected ML or TEL 
35 cells, or FBASALF- transf ected EB8 cells (Table 2) . These 

data suggested that CeB was an extremely efficient MLV gag- 
.- pel expression vector in mink Mv-l-Lu and TE671 cells. CeB 
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was therefore used to derive packaging cells by transfection 
of HT1080 cells. 41/49 blasticidin S-resistant colonies had 
detectable levels of RT; 9 had RT activity higher than that 
of control MLV-A- infected HT1080 cells (data not shown) . 
5 Expression of gag precursor was confirmed in cell lysates 

and supernatants of these 9 HTCeB clones by immunoblotting 
using antibodies against p30-CA (data not shown) . The 4 
clones with the highest expression of gag proteins (clones 
6,13,22 and 48) were infected at high-multiplicity with 

10 helper free, lacZ pseudotypes bearing MLV-A envelopes 

(MFGnlslacZ (A) ) produced by TELCeBS/FBASALF (Table 3) and 
then transfected with FBASALF. Supernatants of bulk, 
phleomycin-resistant transf ectants were assessed for RT 
activity and lacZ titer (Table 2) . Clone HTCeB22, named FLY, 

15 was found to be the best gag-pol producer clone and was used 

to introduce env expression vectors for the generation of 
packaging cell lines. 
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Table 3 . Titer following env constmct tiransf ection 

Producer cell Env source Titer^ 

psiCRIP lacZ 5 pCRIPAMgag- 6x10'*'* 

GP+EAM12 lacZ 25 envAM 3x10^** 

10 TELCeBG FBASALF^ 5x10" 

FBASAF^ 2x10" 

FbdelPASAF^ 2x10" 



7 



TELCeBS FBdelPASAF 1 3x10" 

FbdelPASAF 4 2x10 

FbdelPASAF 6 1x1 O" 

FbdelPASAF 7 5x10" 

FbdelPASAF 8 1x1 0" 

FbdelPRDSAF 2 1x10^ 

2^ FbdelPRDSAF 4 3x1 0^ 

FbdelPRDSAF 7 1x1 O" 

FbdelPRDSAF 8 2x10^ 

FLY^ FBdelPASAF 1 1x10^ 

25 FbdelPASAF 4 1.5x10^ 

FbdelPASAF 5 1x10^ 

FbdelPASAF 7 1x10^ 

FbdelPASAF 13 7x10^ 

FbdelPASAF 14 4x10^ 

FbdelPASAF 15 1x10^ 

FbdelPASAF 16 5x10^ 

FbdelPASAF 17 6x10^ 

FLYA4 lacZ 3 FBdelPASAF 4 2x10"**. 

FLY** FBdelPRDSAF 1 2.5x10^- 

FbdelPRDSAF 2 1x10" 

FbdelPRDSAF 6 5x10^ 

FbdelPRDSAF 10 2x10^ 

FbdelPRDSAF 11 3x10^ 

FbdelPRDSAF 13 1x10^ 

FbdelPRDSAF 17 5x10^ 

FbdelPRDSAF 18 3x10" 

FbdelPRDSAF 19 6x10^ 



45 



50 



Average titers of at least three independent experiments were shown. The 

standard errors did not exceed 30 % of the titer values. 

a: titrated on TE671 cells as lacZ i.u./ml 

b: results of best MFGnlslacZ producer clones. 
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c: bulk populations of env- transf ectants in TELCeBo cells. 

d: titration after bulk infection with helper-free MFGnlslacZ. 

Example 8 

Cons txoict ion of env expression vectors. 
A series of MLV-A env expression plasmids were then 
generated (Fig. i) . m FBASALF, the env gene was inserted 
between two Friend- MLV. LTRs , its expression driven by the 
FB29 MLV LTR {Ferryman et al . , supra). Most of the packaging 
signal located in the leader region was deleted. This 
plasmid also expressed the phleo selectable marker (Gatignol 
et al., supra) driven by the 3' LTR. FBASAF and FBdelPASAF 
were then designed following the same strategy used for CeB. 
These two vectors differed only by the extent of deletion of 
the packaging signal, FBdelPASAF having virtually no leader 
sequence. Compared to pCRIPAMgag- and pCRIPgag-2 env 
plasmids expressed in psiCRIP or psiCRE packaging cells 
(Danos et al . , supra) about 5 Kb of gag-pol sequences was 
removed. In addition the 258 bp retroviral sequence 
containing the end of env gene and the begining of U3 found 
in pCRIPAMgag- and pCRIPgag-2 was also removed. For both 
FBASAF and FBdelPASAF plasmids, the phleo selectable marker 
was inserted downstream of the env gene by positioning a 7 6 
nts linker with no ATG codons between the two open-reading 
frames. Phleo could therefore only be expressed by re- 
initiation of translation by the same ribosomal unit that 
had expressed the upstream env open reading frame. 
FBdelPASAF was also used to generate FBdelPRDSAF, an RD114 
envelope expression plasmid (Fig. 1) . 

After transfection of the env plasmids into TELCeBG cells 
(Table 2) , bulk populations of phleomycin-resistant colonies 
were isolated and their production of lacZ virus measured 
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(Table 3) . FBASALF gave a titer of 5x10^ lacZ- i . u . /ml / whilst 
titers with either FBASAF or FBdelPASAF were 2x10' lacZ- 
i.u./ml (Table 3). Titers of 5x10' or 10' lacZ-i .u. /ml could 
be obtained with some FBdelPASAF cell clones or FBdelPRDSAF 
clones, respectively. 

As FBdelPASAF has minimal virus -derived sequences and was 
shown to be the safest construct (see below and Table 4) 
it and FBdelPRDSAF were used to generate packaging lines' 
from FLY cells (clone HTCeB22, Table 2). Envelope expression 
of these clones was assayed by interference to challenge • 
with MFGnlslacZ(A) or MFGnlslacZ(RD) pseudotypes produced- by 
TELCeB6/FBdelPASAF-7 or TELCeBS/FBdelPRDSAF-7, respectively 
(Table 3). The cell lines showing most interference were 
cross-infected at high multiplicity with these pseudotypes 
to provide MFGnlslacZ proviruses, and supernatants were then 
titrated on TE671 cells (Table 3) . FLY-FBdelPASAF-i3 (FLYA13 
packaging line) and FLY- FBdelPRDSAF- 18 (FLYRDl 8 packaging 
line) gave the highest productions of lacZ viruses, around 
20 10' lacz-i.u./ml. The best MFGnlslacZ producer clones derived 

from either psiCRIP cells (Danos et al . , supra) or GP-^EAM12 
cells (Markowitz et al., supra) gave approximately 50 fold 
lower titers (Table 3). The lacZ titers of the FLY-derived 
lines shown in Table 3 are lower than the best TELCeB6- 
derived lines after transfection of either FBdelPASAF or ' 
FBdelPRDSAF (Table 3) . However it should be noted that the 
lacZ provirus expressed in TELCeB6 cells was obtained after 
clonal selection but was introduced polyclonally in FLY- 
derived env-transfected cell clones. When FLY -FBdelPASAF -4 
cells (FLYA4 packaging line), infected with helper-free 
MFGnlslacZ (RD), were cloned by limiting dilution the best' 
clones (eg. FLYA41acZ3) were found to produce 20 times more 
infectious viruses than the bulk population, reaching the 
range of titers obtained with the best TELCeBS- FBdelPASAF- " 
35 clones (Table 3) . 
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Example 9 

Assays for transfer of gag-pol or env functions. 
To assay for replication-competent viruses, supernatants 
5 were used to infect TEL cells (a clone of TE671 cells 

harboring an MFGnlslacZ provirus) . Infected cells were 
passaged for 6 days or longer and their supernatants were 
used for infection of fresh TE671 cells. No transmission of 
lacZ viruses could be detected {Table 4), demonstrating that 

10 the supernatants of pCRIPAMgag- - , FBASALF-, FBASAF- , or 

FBdelPASAF-transf acted TELCeB6 cells were helper-free. 
Similar absence of replication competent recombinant 
retroviruses was demonstrated using supernatant from a 
clone of psiCRIP-MFGnlslacZ cells or from two clones of 

15 FLYA- MFGnlslacZ cells (Table 4) . 

There have been reports that helper- free retroviral vector 
stocks may nevertheless contain recombinant retroviruses 
(replication incompetent) carrying either gag-pol or env 

20 genes (Bestwick et al . , Proc Natl Acad Sci USA (1988), 85, 

5404-5408, Cosset et al.. Virology (1993), 193, 385-395, 
Girod et al . , Virology (1995), in press) . To assay for such 
recombinant retroviruses, mobilisation of an MFGnlslacZ 
provirus from two indicator cell lines which could cross - 

25 complement potential recombinant viruses carrying either 

gag-pol or env functional genes was attempted. The TELCeB6 
line (Table 2) expressing gag-pol proteins was used as 
indicator cell line to test for the presence of env 
recombinant (ER) viruses. The TELMOSAF indicator line 

3 0 expressing MoMLV env glycoproteins (obtained by transfection 

of FBMOSAF, a plasmid expressing the MoMLV env gene using 
FBASAF backbone, in TEL cells) was used to detect the 
presence of gag-pol recombinant retroviruses (GPR viruses) 
After passaging 4-8 days, the supernatants of the infected 

35 indicator cells were used to infect either human TE671 cells 
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or murine NIH3T3 cells. 



TELCeB6 cells transfected with various env-expressing ' 
constructs, pCRIPAMgag-, FBASAF and FBdelPASAF were 
compared. Although the supernatants of TELCeB 6 -FBdelPASAF 
cells were devoid of replication-competent retroviruses:, 
they were found sporadically to transfer gag-pol genomes 
(Table 4) . No GPR viruses could be detected when less than 
2x10^ virions were used to Infect the indicator cells. 
Similarly TELCeBS indicator cells infected with various 
helper- free viruses were shown sporadically to release lacZ 
virions (Table 4) . The number depended both on the env- : 
expression vector used and on the virus input quantity. 
Compared to lacZ viruses generated using pCRlPAMgag- 
plasmid, the . frequency of detection of the env- recombinant 
viruses was lower for supernatants generated by using FBASAF 
and FBdelPASAF constructs (Table 4) . For FBdelPASAF • 
construct when less than 5x10^ MFGnlslacZ (A) helper-free 
virions were used to infect the indicator cells, no ER 
retroviruses could be detected. From these experiments, .^t 
could be estimated that a supernatant, produced from 
TELCeBS -FBdelPASAF cells, containing 1x10' infectious units 
of MFGnlslacZ retroviral vector contained no replication- 
competent virus, and about 100 gag-pol and 100 env " 
recombinant retroviruses. 
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Table 4. Transfer of packaging function 

Producer cell Indicator cell input virus'^ Detectioif 

(lacZ-i.u.) + -h H- 



Replication competent virus 





psiCRIP lacZ 5 


TEL 


2x10" 


0/4 


0/4 


.4/4 


10 


TELCeB6-pCRIPAMgag- 


TEL 


5x10^ 


0/4 


0/4 


4/4 


TELCeB6-FBASAF 


TEL 


5x10^ 


0/4 


0/4 


4/4 




TELCeB6-FBdelPASAF 


TEL 


5x10* 


0/4 


0/4 


4/4 


15 


FLYA4 lacZ 3 
FLYA4 lacZ 7 


TEL 
TEL 


1x10' 
1x10' 


0/4 
0/4 


0/4 
0/4 


4/4 

4/4 






Gae-Dol recombinant 








20 


TELCeB6-FBdeIPASAF 7 
TELCeB6-FBdeIPASAF 7 
TELCeB6-FBdelPASAF 7 
TELCeB6-FBdelPASAF 7 


TELMOSAF 
TELMOSAF 
TELMOSAF 
TELMOSAF 


2x10' 
2x10^ 
2x10^ 
2x10* 


0/4 
0/4 
0/4 
0/4 


1/4 
2/4 
2/4 
0/4 


3/4 
2/4 
2/4 
4/4 






Env recombinent 








25 


TELCeB6-pCRIPAMgag- 
TELCeB6-pCRIPAMgag- 
TELCeB6-pCRIPAMgag- 


TELCeB6 
TELCeB6 
TELCeB6 


5x10* 
5x10^ 
5x10* 


2/4 
1/4 
0/4 


1/4 
1/4 
2/4 


1/4 
2/4 
2/4 


30 


TELCeB6-FBASAF 
TELCeB6-FBASAF 
TELCeB6-FBASAF 


TELCeB6 
TELCeB6 
TELCeB6 


5x10^ 
5x10^ 
5x10* 


0/4 
0/4 
0/4 


2/4 
1/4 
1/4 


2/4 
3/4 
3/4 


35 


TELCeB6-FBdelPASAF 
TELCeB6-FBdelPASAF 
TELCeB6-FBdelPASAF 


TELCeB6 
TELCeB6 
TELCeB6 


5x10^ 
5x10^ 
5x10* 


0/4 
1/4 
0/4 


1/4 
3/4 
0/4 


3/4 
0/4 
4/4 




a: number of lacZ i.u. used 


to infect indicator 


cells 









b: number of incidence out of four experiments. The ranges of lacZ titers 
rescued from infected indicator cells are shown for each virus input : 
lacZ i.u. /ml (+•*•), 1-100 lacZ i.u. /ml (+) and <l lacZ i.u. /ml (-). 

Titers were determined on TE671 cells for replication 
competent virus and env recombinant and NIH3T3 cells for 
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gag-pol recombinant. 



Example 10 

In order to confirm resistance to complement and absence of 
replication competent virus in our best packaging lines " 
MFGnlslacZ (A) and (RD) harvested from FLYA13 arid FLYRD18 
respectively, after polyclonal transduction of MFGnlslacZ 
(Table 3 above) were tested for stability in fresh human 
serum and generation of replication competent virus. Titers 
of MFGnlslacZ (RD) from FLYRD18 after 1 hr incubation with 3 
independent samples of fresh human serum were 8 0 to 120. ^ 
of control incubations, while titers of MFGnlslacZ (A) from 
FLYA13 were 50 to 90 % of controls (data not shown) . No 
replication competent virus was detected in the same assay 
15 described above (Table 4) when l x lO^ i.u. each of 

MFGnlslacZ (A) and (RD) were tested. 



EXAMPLE 11. 



2 0 Generation of plasmids. 

CeB plasmid (Fig. 5) expressing MoMLV gag-pol gene, was 
further modified to remove the splice donor site located in 
the leader region. A 272 bp fragment was PCR-generated by 
using OUSD- (5 ' -TCTCGCTTCTGTTCGCGCGC) and OLSD- 
(5'-TCGATCAAGCTTGCGGCCGCGGTGGTGGGTCGGTGGTCC) as primers and 
further digested with BssHIl and Hindlll. A -1008 bp 
Hindlli-xhol fragment isolated from CeB (encompassing a part 
of leader sequence and beginning MoMLV gag) and the PGR 
fragment were co-inserted into pCeB from which the 1275 bp 
BssHII-XhoI fragment (encompassing R-U5-leader-gag) had been 
removed. The resulting- plasmid, named pCeB DS- (F^g 5) 
beared the deletion of splice donor (SD) site and a Noti 
restriction site created just downstream to the lost SD 
site. 
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A series of gag-pol expression plasmids in which the MoMLV 
LTR promoter was replaced by the human cytomegalovirus 
immediate early promoter (hCMV promoter) was derived from 
both CeB DS- and hCMV-G (Yee et al . , 1994 PNAS, 91: 9564- 
5 9568), a plasmid used as a source for the hCMV promoter • A 

Notl-filled/EcoRI 7260 bp fragment was isolated from CeB DS- 
and cloned into hCMV-G which had been opened with Sail 
(further rendered blunt-ended) and EcoRI to remove the VSV-G 
gene. The resulting plasmid was cutted with Clal and EcoRI 

10 to remove a 1155 bp fragment encompassing sequence derived 

from 3'-LTR and SV40 polyA sequence and self-ligated after 
filling both protruding DNA ends. The resulting plasmid, 
named phCMV-intron (Fig, 5) , had gag-pol and bsr ORFs 
inserted between the CMV promoter and rabbit beta-globin 

15 polyA post-transcriptional regulatory sequences. 

An intermediate plasmid was generated by sub- cloning a 72 60 
bp EcoRI fragment (isolated from CeB DS-) into hCMVG opened 
with EcoRI. A 1155 bp fragment (encompassing sequence 

20 derived from 3'-LTR and SV40 polyA sequence) was removed 

from this intermediate plasmid which was then 
re-circularized by self ligation after filling both ends. 
The resulting plasmid, named phCMV+intron 2P (Fig. 5), was 
digested with NotI and the vector was treated with klenow 

25 enzyme. A 1440 bp fragment (encompassing hCMV promoter and 

rabbit beta-1 globin intron B (Rohrbaugh et al . , 1985 Mol . 
Cell Biol, 5: 147-160)) was isolated from phCMV+intron 2P by 
Notl/EcoRI digestion. This fragment was further treated with 
klenow enzyme and ligated back into the vector. The 

3 0 resulting plasmid, named hCMV+intron (Fig. 5.) , could express 

gag-pol and bsr genes driven by the hCMV promoter and beared 
an intron sequence derived from rabbit beta-1 globin intron 
B having both SD and SA (splice acceptable) sites. 

35 A 2450 bp fragment was removed from phCMV+intron 2P by 
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Notl/Xhol digestion. The resulting vector fragment was then 
used to co-ligate a 1330 bp fragment (containing hCMV " 
promoter +5' end of rabbit beta-l globin intron B (with SD 
site)) isolated from phCMVG by Apal- filled/Not I digestion 
and a 1 kb fragment isolated from phCMV+intron 2P by i 
Notl-filled/XhoI digestion. Compared to phCMV+intron 2P j the 
resulting plasmid, named hCMV+SD intron (Fig. 5), had the 
deletion of the 3' end of the rabbit beta-l globin introh B 
and thus no SA site in the leader region. 

Construct phCMV.leader (Fig. 5) has been described elsewhere 
(Savard et al . , unpublished). This plasmid, in which gagrpol 
and bsr genes were driven by the hCMV promoter, had the •• 
MoMLV SD site in the leader region. 

Gag-pol expression. 

The different constructs, including the parental CeB 
plasmid, were analysed comparatively in a complementation 
assay after transfection in TEL-FBdelPASAF cells expressing 
4070A-MLV (amphotropic) envelope and harboring a MFGnlslicZ 
provirus. The transient production of lacZ retroviruses ^s 
well as the stable production of lacZ retroviral vectors I 
after selection with blasticidin S were determined . (Tabli . 
5) . All the constructs were able to rescue infectious la^Z • 
retroviruses indicating the expression of gag-pol proteils 
after transient transfection. Most likely due to the 
efficient hCMV and rabbit beta-l globin intron B 
(post) -transcriptional regulatory sequences, hCMV+ intron was 
particularly potent in transient retroviral vector 
production. However, lo times less blasticidin-resistant • 
colonies were obtained with hCMV+intron comparatively to' 
CeB, and stable lacZ virus production from hCMV+ intron was 
about 5-10 times lower than that of CeB. Clonal examination 
of lacZ retrovirus production from blasticidin-resistant' 
colonies indicated that 80-90% of colonies could express 
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high levels of gag-pol proteins for both hCMV+intron and CeB 
plasmids. In contrast, despite variation in their ability to 
form blasticidin-resistant colonies after transfection and 
despite their ability to express gag-pol proteins from 
transient transf ectants , all other constructs had a weak 
capacity for rescuing lacZ retroviral vectors from stable 
transf ectants (Table 5) . 



Table 5. Comparative study of gag-pol-bsr plasmids. 



10 



gag-pol -bsr 
plasmid 


Transient 
(lacZ 
i .u - /ml) 


no clones 
bsr* 


Stable 
(lacZ 

i . u . /ml 


^ gag-pol 
/bsr 


Ceb 


300/ml 


50 


10^ 


90% 


Ceb DS- 


144 /ml 


5 


10^ 


50% 


hCMV+intron 

2P 


ND 


20 


10^ 


50% 


hCMV-intron 


8 12 /ml 


0 






hCMV+SD 
intron 


150/ml 


1000 


10^ 


nd 


hCMV+leader 


328/ml 


1000 


10^-10^ 


nd 


hCMV+intron 


12000/ml 


5 


10^-10"' 


80% 



Northern blot analyses were performed on stable 
transf ectants (blasticidin-resistant) obtained with some of 
the gag-pol-bsr plasmids. As expected, the results (not 
shown) displayed a correlation between expression of gag-pol 
mRNAs and gag-pol protein expression detected by rescue 
analysis (Table 5) . CeB construct was found to produce 2-3 
fold more gag-pol mRNAs compared to hCMV+intron , 
Interestingly, an unexpected 2.45 kb RNA band .was found for 
hCMV+intron construct at a ratio of 2:1 compared to the 
abundancy of the gag-pol mRNA band (at 5.95 kb) . Further 
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xnvestigations by using other probes revealed that a cryptic 
splice donnor (SD) site located in the gag gene (right in 
the middle of the CA coding region at position 1596-1597 ' 
-numbering according to Shinnick et al., 1981 Nature 
(London) 293: 543^548) was activated in this latter 
construct. The 2.45 RNA species, lacking the 3' half of the 
gag gene and most of the pol gene, is unlikely to give rise 
to any useful translational product. It is therefore 
interesting to notice that hCMV+intron construct was abl'e to 
gxve rise to slightly more transcripts (gag-pol 5.95 mRNA + 
2.45 alternative RNA band) compared to gag-pol mRNA 
expressed from CeB construct. Therefore we decided to 
inactivate the cryptic SD site in the hCMV+intron construct 
xn order to increase the ratio of gag-pol mRNAs . 

Assays for transfer of gag-pol functions. " 
Although the supernatants of pacakaging cell lines generated 
With CeB gag-pol expression contruct were devoid of 
replication-competent retroviruses, they were found 
sporadically to transfer gag-pol genomes (example 9, Table 

4) (Cosset et al . , 1995 J. virol 69: 7430-7436). Because 
gag-pol-bsr constructs generated here by using the hCMV ' 
promoter had much less retroviral sequences homologous- t-o 1 
the retroviral vector than the parental CeB construct (Fi^ 

5) , they are less likely to give rise to gag-pol recombinant 
(GPR) viruses. Therefore, the most efficient gag-pol-bsr 
plasmids, hCMV+intron and CeB, were further analysed for 
emergence of GPR viruses. To assay for such recombinant 
retroviruses, we attempted to mobilise an lacZ provirus from 
an indicator cell lines which could cross -complement 
potential recombinant viruses carrying gag-pol functional ' 
genes. Results displayed in Table 6 showed that consistently 
with data reported previously (example 9, Table 4) (Cosset et 
■al.^ 1995 Supra), lacZ retrovirus vectors generated by using 
CeB gag-pol construct were contaminated with GPR viruses. In 
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contrast lacZ retrovirus vectors generated by using 
hCMV+intron construct were completely devoid of such GPR 
viruses, suggesting that this construct was "improved 
compared to CeB with respects with emergence of recombinant 
5 viruses. 



Table 6. Comparative study of gag-pol-bsr plasmids 



10 



plasmid 


input virus 
(lacZ i.u. )^ 


no of experiments 
giving titres of^ 


CeB 


5x10^ 


5 


3 


0 




5x10^ 


2 


4 


2 




5x10^ 


0 


1 


7 


hCMV+intron 


5x10^ 


0 


0 


8 




5x10^ 


0 


0 


8 




5x10^ 


0 


0 


8 



4X10E4 cells of TEL/MOSAF in 24 wells were challenged with lacZ (A) of i.u. 
indicated in the table (a), and incubated at 37°C for 3 days. Cells were 
trypsinized and transferred into small flasks. Cell sup was harvested on day 5 
after lacZ (A) challenge and plated on either TE571 (not shown) and 3T3 cells 
{b) . No lacZ was mobilized into TE671 at all. LacZ (A) from CMV-int 10 again 
did not rescue lacZ from TEL/MOSAF. 

Example 12 

Generic primers to detect D-type (Medstrand and Blomberg 
J.Virol. (1993) 67:6778-6787) , C-type (Shihet al . , J 
Virol. (1989) 63:64-75), human endogenous virus RTVL-H 
(Wilkinson et al . , J.Virol. (1993) 67:2981-2989), by RT-PCR 
were employed (Patience et al., supra). Primers to detect 
mouse endogenous VL3 0 element (Adams et al Mol . Cel . Biol . 
(1988) 8:2989-2998), and MFGnlslacZ RNA were designed and 
synthesized (TABLE X) . Overnight supernatant s (in 4ml of '.. 
culture medium) from 106 cells of GP+EAM12lacZ25 , FLYA4lacZ3 
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and TELCeBSFBASALF cells (Table 3) were harvested and' 
centrifuged in sucrose gradient as described previously 
(Patience et al., J.Virol., 70:2654-2657). Fractions 
containing retrovirus particles were collected, and rna 
extracted. One twentieth of the RNA preparation or 
dilution's thereof were applied to RT-PCR as described 
previously (Table X) . A 1/200 of RNA harvested from 
GP+EAM121acZ25 cells was positive for VL30 RNA. MFGnlslacZ 
RNA was found from 1/20 of RNA from GP+EAM121acZ and 
TELCeB6FBASALF cells and 1/200 of RNA from FLYA4lacZ3 cells 
The primer combinations for RTVL-H, C- and D-type RNA did 
not give detectable PGR product. 

Table 7. RT-PCR detection of endogenous retrovirus RNA 
15 associated with vims particles. 



2 0 RNA 



rt-pcr of virion associated RNA from* 

primer (5' -3') GP+EAM12 FLyA4 TELCeBSF 

forward (F) /reverse (R) lacZ25 lacZ3 BASALF 



2 5 lacZ 



MFGnls F) CTCTGGCTCACAGTACGACGTAG 



R) CCATCAATCCGGTAGGTTTTCCG 



++ 



C- type F) CARRGKTTCAARAACWSYCCCAC 

R) AGYARVGTAGCNGGGTTHAGG 

D-type F) TCCCCTTGGAATACTCCTGTTTTYGT 

R ) CATTCCTTGTGGTAAAACTTTCCAYTG 

RTVL-H F) CCTCACCCTGATCACRYTTG 

^ R ) GAATTATGTCTGACAGAAGGG 

^VL3 0 F) GTTGACATCTGCAGAGAAAGACC 

R) TCTGAGGTCTGTACACACAATGG 



NT 



+ + NT NT 
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a:-, not detected; + detected in 1/20 RNA preparation; ++ detected in 1/200 RNA 
preparation; NT, not tested because the cells do not possess the corresponding 



20 



30 



genes . 



EXAMPLE 13. 



Generation of gag-pol pre-packaging cells by using TE671 
cells . 

10 CeB, a plasmid designed to over-express MoMLV gag and pel 

proteins was introduced in TE671 human rhabdomyosarcoma 
cells (ATCC CRL8805). After selection with blasticidin, 50 
bar-positive colonies were isolated and the RT (reverse 
transcriptase) activity was analysed in their supernatants. 
15 12 TE671-CeB (TECeB) clones with high RT activity were 

selected for further analysis. The best TECeB clone, clone 
#15, had a RT activity roughly equivalent to that TELCeB6 
cells (Cosset et al., J. Virol. 69:7430-7436 (1995); see 
also Example 7, Table 6 in this patent application) but 
displayed 2-3 fold more gag-precursors into cells as 
demonstrated in immunoblots by using anti-CA antibodies. The 
biological activity of gag-pol proteins expressed in the six 
best TECeB clones was further confirmed by their ability to 
produce infectious retroviruses in a complementation assay. 
25 A lacZ provirus was introduced into each of the TECeB clones 

by polyclonal cross-infection by using lacZ(RD114) helper- 
free retrovirus vectors. FBMOSALF, a MoMLV env expression 
plasmid (Cosset et al., J. Virol. 69:6314-6322), was then 
transfected in each of the TECeB-lacZ lines and in the 
TELCeBG cell line for comparison. After selection with 
phleomycin, the titer of lacZ retrovirus vectors was 
determined in the supernantant of pools of phleomycin- 
resistant colonies for each TECEB-lacZ-FBMOSALF lines. A 
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good correlation was found between gag-pol expression" into 
the TE-CeB clones (as determined by RT-assays and anti-gag 
xnm,unoblots) and their ability to release infectious lad 
particles. TE-CeB15 cells could release approximately the 
same number of lacZ particles when compared to TELCeB6 cells 
although TELCeB6 cells had the advantage of being selected 
for lacZ expression (Cosset et al., j. virol. 69: 7430--7436 
(1995)). TE-CeB15 cells were therefore used to derive ■ 
retroviral packaging cell lines. 

Construction of env-eatpression plasmids . 

A series of plasmid (Fig. 3) was designed to allow 

expression of different retroviral envelope genes (isolated 

from MoMLV, GALV -Gibbon Ape Leukemia Virus-, and MLV-lOAl) 

FBdelPMOSAF (Fig. 3, nucleotide sequence in -Fig. 10) and. 

FBdelPlOAlSAF, expressing ecotropic MoMLV or MLV-lOAl 

envelopes, were generated by replacing the Bglli/cial 

fragment from FBdelPASAF (Cosset et al . , J. virol. 69:7430- . 

7436 (1995); see also Example 7, Fig. 2 and nucleotide 

sequence in Fig. 9) encompassing most of the- env gene and 

splice acceptor site with that of MoMLV (position 5407 to^ 

767.9, Shinnik et al., 1981) or with that of MLV-lQAl (Otti et | 

al., J. Virol. 64:757-766 (1990)). • 

Nucleotides 7514-7516 of GALV (Delassus et al.. Virology' 

173:205-213 (1989)) were mutated by PCR-mediated mutagenesis 

to create a Clal site (AAG to CGA) , thereby introducing a. 

conservative modification (a lysine (amino-acid 665 of GALV 

env precursor) to an arginine) . The BamHI/Clal fragment (hts 

4994 (Delassus et al. Virology 173:205-213 (-1989)) to 7517) 

was then sub-cloned into FBdelPASAF in which the Bglll/cial 

encompassing most of the env gene and splice acceptor site 

had been removed. The resulting plasmid, expressing GALV ' 
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envelope glycoproteins, was named FBdelPGASAF (Fig. 3, 
nucleotide sequence in Fig. 11). 

CMVlOAl was generated by inserting a Klenow enzyme-filled 
Eagl/Sall fragment from FBdelPlOAlSAF (encompassing lOAl MLV 
5 env gene and phleo selectable marker) into hCMV-G digested 

.with BamHI and filled with Klenow enzyme. The resulting 
plasmid, CMVlOAl (Fig. 3 and nucleotide sequence in Fig. 13) 
could express lOAl envelopes under control of the hCMV 
promoter and the phleo selectable marker by translation re- 
10 initiation. 

Generation of a multi-tropic set of TE671-based retroviral 
packaging lines. 

FBdelPRDSAF (Fig. 3, nucleotide sequence in Fig. 12), 

15 FBdelPASAF, FBdelPGASAF, FBdelPMOSAF and FBdelPlOAlSAF were 

independently introduced into cells of the TE-CeB15 pre- 
packaging line, expressing MoMLV gag-pol proteins. 
Transfected cells were phleomycin-selected and 15-20 phleo- ■ 
resistant colonies were isolated for each env-expression 

20 plasmid transfected. 

Individual colonies were then analysed for expression of 
envelope glycoproteins by immunoblots on cell lysates by 
using antibodies against RD114 SU glycoproteins or against 
Rausher leukemia virus SU (to screen MoMLV, MLV-4070A and 

25 MLV-lOAl env-producer clones) or against GALV. The best env- 

producer colonies as determined in this assay were further 
analysed by a complementation assay after introducing a lacZ 
retroviral vector. LacZ pseudotypes released from the 
different packaging cell lines were titrated by using NTH 

30 3T3 cells or TE671 cells as target. Titers higher than 1x10^ 

lacZ i.u./ml were obtained for the best clones. Depending on 
the envelope specificities expressed in these cells, the new 
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TE671-based retroviral packaging cell lines were named TE- 
FLYE, TE-FLYA, TE-FLYRD, TE-FLYlOAl, and TE-FLYGA and coMd 
express the MoMLV, MLV-4070A, RD114, MLV-lOAl, and GALV Lv 
genes, respectively. • /. 

.Assays for detecting replication-competent retroviruses ^ 
(RCRs) were performed in the supernatants of these cells and 
were negative (less than l/ml) . 

TE671 cells are very potent for transient expression 
resulting in more than 95% of cells expressing transgene 
three days after plasmid transfection (Hatziioannou and ■ 
cosset, unpublished data, (1996) ) . The ability of retroviral 
packaging cell lines to transiently produce retroviral" 
vectors is of crucial importance for gene therapy where • 
vectors carrying toxic gene have to be prepared. Transient 
expression of retroviral vectors was comparatively 
determined from' cells of the TE-FLYA line and from the BING- • 
line (Pear et al., Proc Natl Acad Sci U S A 90, 8392-6 
(1993)), a retroviral packaging cell line designed to 
transiently express retroviral vectors. Results (Table 8) 
Showed that -TE-FLYA cells were more efficient for transient 
expression of a lacZ retroviral vector hence resulting in^ " 
higher titers. 



Table 8 
vectors . 



Coa5»arative study of transient production of lacZ 



packaging 
cell line 


cell nuxzibexr^ 


% transfected 
cells*' 


transient 

titer" 


BXKG 


281 


5.3 


2x10^ 


TE-EXYA 

Cells were transfecte 


117 

d by MFGnislacZ rerrr 


35 


1.3x10^ 
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supernacaat were determined as lacZ i.u./ml at day 3 following transf action. The 
relative number of cells (a) (average per microscope field) and the % of 
transfected cells (b) determined after X-gal staining are shown. 

Retroviral vectors prepared from TE671-based packaging cell 
lines were analysed for their sensitivity to human- 
complement mediated inactivation . Experiments were conducted 
as previously described (Cosset et al., J. Virol. 69:7430- 
7436 (1995); see also Example 10 in this patent application) 
by using three human sera of individual donnors (Table 9) . 
As expected MLV-A prepared from mouse 3T3 cells were highly 
sensitive to inactivation after 1 hr incubarion witn sera. 
In contrast, titers of lacZ vectors produced from TE-FLYRD 
cells were 17 to 55% of control incubations, while titers of 
lacZ vectors from TE-FLYA cells were 1 to 30% of controls. 



20 



25 



Table 9. Human serum sensitivity of vixiases produced from 
TE671-based packaging cell lines. 



Virus from: 


hu56' 


hu57» 


BTS" 


3T3/A 


<0.2, <0.2 


<0.2, <0.2 


<0.2, <0.2 


TE-FLYE 


15, 7.8 


16, 11 


48, 60 


TE-FLYA 


1, 0.6 


2.2, 7.1 


28, 19 


TE-FLYFD 


17, 22 


30, 44 


54, 63 



_ .j.Lii-'a.u.i-clut:: , iiu:30 {i\f ) , nUD / I 

ETS(AB+). (a) % control (average for FCS and opti-MEM treatment) is shown. 
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CLAIMS : 

1. A recombinanc expression vector comprising a gene of 
interest and a selectable marker gene, wherein the 
selectable marker gene is arranged downstream of th^ 
gene of interest and a st6p codon associated with the 
gene of interest is spaced from a start codon of said 
selectable marker gene at a distance which- is 
sufficient to ensure that said selectable' marker 
protein is expressed from the corresponding mRNA as a 
result of translation reinitiation. i 



2 . 



4 . 



6 . 



A recombinant expression vector according to claim 1 
wherein the vector is a viral vector. 



3. A recombinant expression vector according to claim 2 
wherein the vector is a retroviral vector. 



A recombinant expression vector according to any one 
of claims 1 to 3 wherein the gene of interest 
included as part of a viral packaging construct. 



is 



A recombinant expression vector according to any one! 
of the preceding claims wherein the number of 
nucleotides in the space between the stop codon of the' 
gene of interest and the start codon of the selectable 
marker is in the range of from 2 0 to 200 nucleotides. 

A recombinant expression vector according to claim 5 
wherein the number of nucleotides in the space betweeh 
the stop codon of the gene of interest and the start 
codon of the selectable marker is in the range of from 
60 to 8 0 nucleotides. 

A process for producing a cell line in which a gene of 
interest is expressed, which process comprises: 

transforming host cells with an expression vector 
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according to any one of the claims 1 to 5 ; and 

selectable those cells where expression of the 
selection marker gene may be detected. 

8. A process according to claim 7 wherein the host cell 
is a eukaryotic cell. 

9. A host cell transformed with a recombinant expression 
vector according to any one of the claims 1 to 6 . 

10. A retroviral . packaging cell line comprising a host 
cell transformed with a first and a second recombinant 
expression vector, said first recombinant expression 
vector having a packaging-deficient construct 
comprising a viral gag-pol gene and a first selectable 
marker gene downstream thereof, and said second 
recombinant expression vector having a packaging- 
deficient construct comprising a viral env gene and a 
second selectable marker gene downstream thereof; 
wherein the start codon of the first and second 
selectable markers are spaced from the stop codons of 
the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that said 
selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation . 

11. A retroviral packaging cell line according to claim 10 
wherein the first selectable marker is a bsr 
selectable marker and the second selectable marker is 
a phleo selectable marker. 

12 . A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the packaging-deficient 
construct comprising the viral gag-pol gene and first 
selectable marker is the CeB (SEQ ID No 2) expression 
construct . 
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13. A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the packaging- deficient 
construct comprising the viral env gene and second 
selectable marker is the FBdelPASAF (SEQ ID No 5), 
the FBdelPMOSAF (SEQ ID No 6), the FbdelPGASAF (SEQ ID 
No 7) , the FbdelPRDSAF (SEQ ID No 8), the FbdelPXSAF 
(Fig. 3), the FbdelPlOAlSAF (Fig. 3), or the 
FBdelPVSVGSAF (Fig. 3) expression construct. 

14. A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the recombinant expression 
vector is a packaging-deficient retroviral helper 
construct . 

15 . A retroviral packaging cell line according to claim .14 
wherein the overlapping sequences between the genoiries 
of the retroviral vector and the packaging-deficient 
construct is reduced by minimizing the extent of non- 
coding retroviral sequences in the packaging-deficient 
genome . 

16. A retroviral packaging cell line according to any one 
of claims 10 to 15 wherein the viral gag-pol gene and 
the selectable marker are expressed under the control 
of a non- retroviral promoter. 

17. A retroviral packaging cell line according to claim 16 
wherein the promoter is fused to rabbit beta-l globin 
intron . 



18. A retroviral packaging cell line according to claim 16 
or claim 17 wherein the promoter is a hCMV proraoteir. 

19. A retroviral packaging cell line according to any one 
of claims 16 to claim 18 wherein the viral gag-pol 
gene and the selectable marker is a hCMV+intron (SEQ 

SUBSTITUTE SHEET (RULE 26) 
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ID No3) or a hCMV+intronkaSD (SEQ ID No 4) expression 
construct . 

A retroviral packaging cell line according to anyone 
of claims 10 to 15 wherein the viral env gene and the 
selectable marker are under the control of a non- 
retroviral promoter. 

21, A retroviral packaging cell line according to claim 20 
wherein the promoter is fused to rabbit beta-l globin 
intron. 

22. A retroviral packaging cell line according to claim 20 
or claim 21 wherein the promoter is a hCMV promoter. 

23 . A retroviral packaging cell line according any one of 

claims 20 to 22 wherein the viral env gene and the 
selectable marker is a CMVlOAl (SEQ ID No 9) 
expression construct . 

24 . A retroviral packaging cell line according to any one 

of claims 10 to 23 wherein the cell line is the HT108 0 
line, the TE671 line, the 3T3 line, the 293 line or 
the MV-i-lu line- 

25. A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human HT108 0 cells and express RD114 
envelopes. 

26. A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human TE671 cells and express RD114 
envelopes . 
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27. A process for producing a retroviral packaging cell 
line in which a gene of interest in expressed, which 
process comprises : 

transforming host cells with a first and a second 
recombinant expression vector, said first recombinant 
expression vector having a packaging-deficient 
construct comprising a viral gag-pol gene and a first 
selectable marker gene downstream thereof, and said 
second recombinant expression vector having a 
packaging-deficient construct comprising a viral env 
gene and a second selectable marker gene downstream 
thereof; wherein the start codon of the first and 
second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env 
gene respectively by a distance which ensures that 
said selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation; and 

selecting transformed cells which express said 
fi^st and/or second marker genes. 



28 



29 



A packaging deficient construct for use in a process 
according to claim 27, which expresses a viral gag-pol 
gene and a selectable marker wherein a start codon of 
the selectable marker is spaced from a stop codon of 
the viral gag-pol gene by a distance which ensures 
that said selectable marker protein is expressed from 
the corresponding mRNA as a result of translation 
reinitiation. 

A packaging deficient construct for use in a process 
according to claim 27, which expresses a viral env 
gene and a selectable marker gene; wherein a start 
codon of the selectable marker is spaced from a stop 
codon of the viral env gene by a distance which 
ensures that said selectable marker protein is 
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translation reinitiation. 
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po/ gene... i CT AGA CTG ACA 75 G CG'C 
GTT CAA CGC TCT CAA AAC CCC TTA 
AAA ATA AGG TTA ACC CGC GAG GCC 
CCC TAA 

tcccctLaattcttctcatgctcagaggggtcagtac 
tgcttcgcccgccTCcagtgcagcccagccggccacc 
i\7<g AAA ACA TTT AAC ATT TCT,.. bsr gene 



Figure 1. Schematic structure of CeB expression vector 



SUBSTITUTE SHEET (RULE 26) 



BNSDCXJID: <WO 9708330A1J_> 



wo 97/08330 



PCT/GB96/02061 



2/22 



.del psi 









FB29 LTR 
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SD 




ENV 









SA 



PHLEO 



poly A 



env gene...TAC GAG CCA TAG 
QQCQcc tagtgttgacaattaatcatcggcatagtata 

cggcatagtataatacgactcactataggagggccacc 
Ml-G CC AAG TTG ACC.phleo gene 



Figure 2. Schematic structure of FBdelPASF expression vector 
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Figure 3. Schematic structure of env expression vectors 
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NGAGCTC AGGACAGGTAGAAAGAATGAATAGAACAATAAAAGAGACCCTT ACTAAATTGA 6 0 
CCTTAGAGACTGGCTT AAAAGATTGGAGACGCCTCCTATCTCTGGCTTTGTTAAGAGCCA. 120 
G.^AAT ACGCCC AACCGTTTTCGGCTC ACCCC ATATGA^^ATCCTTT ATGGGGGACCCCCCC 180 
CTTTGTCAACCTTGCTCAATTCCTTCTCCCCCTCCGATCCTAAGACTGATTTACAAGCCC 2 40 
GACTAAAAGGGCTGCAAGGCGTGCAGGCCCAAATCTGGACACCCCTGGCCGAATTGTACC 3 00 
GGCCAGG AC ATCC AC AAACTAGCCACCCATTTCAGGTGGGAGACTCCGTGTACGTCCGGC 3 60 
GGCACCGCTCTC.AAGGATTGGAGCCTCGTTGGAAGGGACCTTACATCGTCCTGCTGACCA 4 20 
CGCCCACCGCCATAAAGGTTGACGGGATCGCCGCCTGGATTCACGCATCGCACGCCAAGG 480 
CAGCCCCAA-AAACCCCTGGACCAGAAJVCTCCCAAAACCTGGAAGCTCCGCCGTTCGGAGA 5 40 
ACCCTCTTAAGATAAGACTCTCCCGTGTCTGACTGCTAJi.TCCACCTTGTCCCTGTACTAA 6 00 
CCCAAAATGA-AACTCCCAACAGGAATGGTCATTTTATGTAGCCTAATAATAGTTCGGGCA 6 60 
GGGTTTGACGACCCCCGCAAGGCTATCGCATT AGTACAAAAACAJi.CATGGTAAACCATGC 7 20 
GAATGC AGCGGAGGGCAGGTATCCGAGGCCCCACCGAACTCCATCCAACAGGTAACTTGC 7 3 0 
CCAGGCAAGACGGCCTACTTAATGACCAACCAAAAATGGAAATGCAGAGTCACTCCAAAA 8 40 
ATCTCACCTA.GCGGGGGAGAACTCCAGAACTGCCCCTGTAACACTTTCCAGGACTCGATG 9 00 
CACAGTTCTTGTTATACTGAATACCGGCAATGCAGGCGAATTAJVTAAGACATACTACACG 9 60 
GCCA-CCTTGCTTAAAATACGGTCTGGGAGCCTC AACGAGGTACAGATATT ACAAAACCCC 1020 
AATCAGCTCCTACAGTCCCCTTGTAGGGGCTCTATAAATCAGCCCGTTTGCTGGAGTGCC 1080 
ACAGCCCCCATCCATATCTCCGATGGTGGAGGACCCCTCGATACTAAGAGAGTGTGGACA 1140 
GTCCAAAAAAGGCTAGAACAAATTCATAAGGCTATGACTCCTGAACTTCAATACCACCCC 1200 
TTAGCCCTGCCCAAAGTCAGAGATGACCTTAGCCTTGATGCACGGACTTTTGATATCCTG 12 60 
AATA.CC ACTTTT AGGTTACTCC AGATGTCC AATTTTAGCCTTGCCCA-AGATTGTTGGCTC 1320 
TGTTTAAAACTAGGTACCCCTACCCCTCTTGCGATACCCACTCCCTCTTTAACCTACTCC 1380 
CTAGCAGACTCCCTAGCGAATGCCTCCTGTCAGATTATACCTCCCCTCTTGGTTCAACCG 1440 
ATGCAGTTCTCCAACTCGTCCTGTTTATCTTCCCCTTTCA.TTAA-CGATACGGAACAAATA 1500 
G ACTTAGGTGC AGTCACCTTTACTAACTGC ACCTCTGT AGCCAATGTCAGTAGTCCTTT A 1560 
TGTGCCCTAAACGGGTCAGTCTTCCTCTGTGGAAATAACATGGCATACACCTATTTACCC 1620 
CAAAACTGGACCAGACTTTGCGTCC AAGCCTCCCTCCTCCCCGACATTGACATCAACCCG 1580 
GGGGATG AGCC AGTCCCC ATTCCTGCC ATTG ATC ATTAT ATAC A.TAGACCTAAACGAGCT 1740- 
GTACAGTTCATCCCTTTACTAGCTGGACTGGGAATCACCGCAGCATTCACCACCGGAGCT 1800 
ACAGGCCTAGGTGTCTCCGTCACCC AGTATACAAAATT ATCCCA.TCAGTTAATATCTGAT I860 
GTCC AAGTCTT ATCCGGTACC ATAC AAGATTTACAJVGACC AGGT AGACTCGTT AGCTGAA 1920 
GTAGTTCTCCAAAATAGGAGGGGACTGGACCTACTA;^.CGGCAGAJi.CAAGGAGGAATTTGT 1980 
TT AGCCTTAC AAG AAAAATGCTGTTTTTATGCT AAC AAGTC AGGAATTGTGAGAAACAAA 2040 
ATAAGAACCCTAC AAGAAGAATTAC AAAAACGCAGGGAAAGCCTGGCAACCAACCCTCTC 2100 
TGGACCGGGCTGCAGGGCTTTCTTCCGTACCTCCTACCTCTCCTGGGACCCCTACTCACC 2160 
CTCCTACTCATACTAACCATTGGGCCATGCGTTTTC AGTCGCCTCATGGCCTTCATTAAT 2220 
GATAGACTTAATGTTGTAC ATGCC ATGGTGCTGGCCC AGCAATACC AAGC ACTCAAAGCT 2 280 
GAGGAAGAAGCTCAGGATTGAGCTTCCGGGACAAAAGCAGGGGGGAATGAG AAGTC AGAA 2340 
CCCCCCACCTTTGCTACATAAATAACCGCTTTCATTTCGCTTCTGTAAAACGCTTATGCG 2 400 
CCCCACCCTAGCCGGAAAGTCCCCAGCCGCTACGCAACCCGGGCCCCGAGTTGCATCAGC 2 460 
CGTTCGC AACCCGGGCTCCGAGTTGCATCAGCCGAAA.GAAACTTCATTTCCCAAGCTT 2 518 



Fig. 4 
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Figure 6, CeB Sequence 8/22 1 

AATGAAAGAC CCCACCTGTA GGTTTGGCAA GCTAGCTTAA GTAACGCCAT TTTGCAAGGC - 60 
ATGGAAAAAT ACATAACTGA GAATAGAGAA GTTCAGATCA AGGTCAGGAA CAGATGGAAC 12 0 

AGCTGAATAT GGGCCAAACA GGATATCTGT GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC 180 
AAGAACAGAT GGAACAGCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG 240 
CCCCGGCTCA GGGCCAAGAA CAGATGGTCC CCAGATGCGG TCCAGCCCTC AGCAGTTTCT 300 
AGAGAACCAT CAGATGTTTC CAGGGTGCCC CAAGGACCTG AAATGACCCT GTGCCTTATT 3 60 

TGAACTAJVCC AATCAGTTCG CTTCTCGCTT CTGTTCGCGC GCTTCTGCTC CCCGAGCTCA 420 
ATAAAAGAGC CCACAACCCC TCACTCGGGG CGCCAGTCCT CCGATTGACT GAGTCGCCCG 480 
GGTACCCGTG TATCCAATAA ACCCTCTTGC AGTTGCATCC GACTTGTGGT CTCGCTGTTC 540 
CTTGGGAGGG TCTCCTCTGA GTGATTGACT ACCCGTCAGC GGGGGTCTTT CATTTGGGGG 6 00 

CTCGTCCGGG ATCGGGAGAC CCCTGCCCAG GGACCACCGA CCCACCACCG GGAGGTAAGC 660 
TGGAAGCTTC TGCAGCATCG TTCTGTGTTG TCTCTGTCTG ACTGTGTTTC TGTATTTGTC 72 0 

TGAGAATATG GGCCAGACTG TTACCACTCC CTTAAGTTTG ACCTTAGGTC ACTGGAAAGA 78 0 

TGTCGAGCGG ATCGCTCACA ACCAGTCGGT AGATGTCAAG AAGAGACGTT GGGTTACCTT 840 
CTGCTCTGCA GAATGGCCAA CCTTTAACGT CGGATGGCCG CGAGACGGCA CCTTTAACCG 900 
AGACCTCATC ACCCAGGTTA AGATCAAGGT CTTTTCACCT GGCCCGCATG GACACCCAGA 9 60 

CCAGGTCCCC TACATCGTGA CCTGGGAAGC CTTGGCTTTT GACCCCCCTC CCTGGGTCAA 1020 

GCCCTTTGTA CACCCTAAGC CTCCGCCTCC TCTTCCTCCA TCCGCCCCGT CTCTCCCCCT 1080 

TGAACCTCCT CGTTCGACCC CGCCTCGATC CTCCCTTTAT CCAGCCCTCA CTCCTTCTCT 1140 

AGGCGCCAAA CCTAAACCTC AAGTTCTTTC TGACAGTGGG GGGCCGCTCA TCGACCTACT 12 00 

TACAGAAGAC CCCCCGCCTT ATAGGGACCC AAGACCACCC CCTTCCGACA GGGACGGAAA 12 60 

TGGTGGAGAA GCGACCCCTG CGGGAGAGGC ACCGGACCCC TCCCCAATGG CATCTCGCCT 1320 

ACGTGGGAGA CGGGAGCCCC CTGTGGCCGA CTCCACTACC TCGCAGGCAT TCCCCCTCCG 13 80 

CGCAGGAGGA AACGGACAGC TTCAATACTG GCCGTTCTCC TCTTCTGACC TTTACAACTG 1440 

GAAAAATAAT AACCCTTCTT TTTCTGAAGA TCCAGGTAAA CTGACAGCTC TGATCGAGTC 1500 

TGTTCTCATC ACCCATCAGC CCACCTGGGA CGACTGTCAG CAGCTGTTGG GGACTCTGCT 1560 

GACCGGAGAA GAAAAACAAC GGGTGCTCTT AGAGGCTAGA AAGGCGGTGC GGGGCGATGA 1620 

TGGGCGCCCC ACTCAACTGC CCAATGAAGT CGATGCCGCT TTTCCCCTCG AGCGCCCAGA 1680 

CTGGGATTAC ACCACCCAGG CAGGTAGGAA CCACCTAGTC CACTATCGCC AGTTGCTCCT 1740 

AGCGGGTCTC CAAAACGCGG GCAGAAGCCC CACCAATTTG GCCAAGGTAA AAGGAATAAC 1800 

ACAAGGGCCC AATGAGTCTC CCTCGGCCTT CCTAGAGAGA CTTAAGGAAG CCTATCGCAG 18.60 

GTACACTCCT TATGACCCTG AGGACCCAGG GCAAGAAACT AATGTGTCTA TGTCTTTCAT 1920 

TTGGCAGTCT GCCCCAGACA TTGGGAGAAA GTTAGAGAGG TTAGAAGATT TAAAAAACAA 1980 

GACGCTTGGA GATTTGGTTA GAGAGGCAGA AAAGATCTTT AATAAACGAG AAACCCCGGA 2040 

AGAAAGAGAG GAACGTATCA GGAGAGAAAC AGAGGAAAAA GAAGAACGCC GTAGGACAGA 2100 

GGATGAGCAG AAAGAGAAAG AAAGAGATCG TAGGAGACAT AGAGAGATGA GCAAGCTATT 2160 

GGCCACTGTC GTTAGTGGAC AGAAACAGGA TAGACAGGGA GGAGAACGAA GGAGGTCCCA 2220 

ACTCGATCGC GACCAGTGTG CCTACTGCAA AGAAAAGGGG CACTGGGCTA AAGATTGTCC 2280 

CAAGAAACCA CGAGGACCTC GGGGACCAAG ACCCCAGACC TCCCTCCTGA CCCTAGATGA 23 40 

CTAGGGAGGT CAGGGTCAGG AGCCCCCCCC TGAACCCAGG ATAACCCTCA AAGTCGGGGG 2400 

GCAACCCGTC ACCTTCCTGG TAGATACTGG GGCCCAACAC TCCGTGCTGA CCCAAAATCC 2460 

TGGACCCCTA AGTGATAAGT CTGCCTGGGT CCAAGGGGCT ACTGGAGGAA AGCGGTATCG 2520 

CTGGACCACG GATCGCAAAG TACATCTAGC TACCGGTAAG GTCACCCACT CTTTCCTCCA 2 580 

TGTACCAGAC TGTCCCTATC CTCTGTTAGG AAGAGATTTG CTGACTAAAC TAAAAGCCCA 2 640 

AATCCACTTT GAGGGATCAG GAGCTCAGGT TATGGGACCA ATGGGGCAGC CCCTGCAAGT 2700 

GTTGACCCTA AATATAGAAG ATGAGCATCG GCTACATGAG ACCTCAAAAG AGCCAGATGT 27 60 

TTCTCTAGGG TCCACATGGC TGTCTGATTT TCCTCAGGCC TGGGCGGAAA CCGGGGGCAT 2 82 0 

GGGACTGGCA GTTCGCCAAG CTCCTCTGAT CATACCTCTG AAAGCAACCT CTACCCCCGT 2 880 

GTCCATAAAA CAATACCCCA TGTCACAAGA AGCCAGACTG GGGATCAAGC • CCCACATACA 2 940 

GAGACTGTTG GACCAGGGAA TACTGGTACC CTGCCAGTCC CCCTGGAACA CGCCCCTGCT 3 000 

ACCCGTTAAG AAACCAGGGA CTAATGATTA TAGGCCTGTC CAGGATCTGA GAGAAGTCAA 3 060 

CAAGCGGGTG GAAGACATCC ACCCCACCGT GCCCAACCCT TACAACCTCT TGAGCGGGCT 312 0 ' 

CCCACCGTCC CACCAGTGGT ACACTGTGCT TGATTTAAAG GATGCCTTTT TCTGCCTGAG 3180 

ACTCCACCCC ACCAGTCAGC CTCTCTTCGC CTTTGAGTGG AGAGATCCAG AGATGGGAAT 3 240 

CTCAGGACAA TTGACCTGGA CCAGACTCCC ACAGGGTTTC AAAAACAGTC CCACCCTGTT 3 3 00 

TGATGAGGCA CTGCACAGAG ACCTAGCAGA CTTCCGGATC CAGCACCCAG ACTTGATCCT 33 60 

GCTACAGTAC GTGGATGACT TACTGCTGGC CGCCACTTCT GAGCTAGACT GCCAACAAGG 3 42 0 

TACTCGGGCC CTGTTACAAA CCCTAGGGAA CCTCGGGTAT CGGGCCTCGG CCAAGAAAGC 3 480 

CCAAATTTGC CAGAAACAGG TCAAGTATCT GGGGTATCTT CTAAAAGAGG GTCAGAGATG 3 540 

GCTGACTGAG GCCAGAAAAG AGACTGTGAT GGGGCAGCCT ACTCCGAAGA CCCCTCGACA 3 600 

ACTAAGGGAG TTCCTAGGGA CGGCAGGCTT CTGTCGCCTC TGGATCCCTG GGTTTGCAGA 3 6 60 

AATGGCAGCC CCCTTGTACC CTCTCACCAA AACGGGGACT CTGTTTAATT GGGGCCCAGA 37 20 

CCAACAAAAG GCCTATCAAG AAATCAAGCA AGCTCTTCTA ACTGCCCCAG CCCTGGGGTT 3780 

GCCAGATTTG ACTAAGCCCT TTGAACTCTT TGTCGACGAG AAGCAGGGCT ACGCCAAAGG 3 840 

TGTCCTAACG CAAAAACTGG GACCTTGGCG TCGGCCGGTG GCCTACCTGT CCAAAAAGCT 3 900 

AGACCCAGTA GCAGCTGGGT GGCCCCCTTG CCTACGGATG GTAGCAGCCA TTGCCGTACT 3 960 

GACAAAGGAT GCAGGCAAGC TAACCATGGG ACAGCCACTA GTCATTCTGG CCCCCCATGC 4020 

AGTAGAGGCA CTAGTCAAAC AACCCCCCGA CCGCTGGCTT TCCAACGCCC GGATGACTCA 4080 
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CTATCAGGCC 
CCCGGCTACG 
GGCCGAAGCC 
CACCTGGTAC 
GGTGACCACC 
GCGGGCTGAA 
TGTTTATACT 
AAGGCGTGGG 
CCTACTAAAA 
.^J^GGGACAC 
AGCCATCACA 
CTCAGAACAT 
TTATGATAAA 
TACTTTTGAA 
GGCTCTCCTA 
A.=-ATATCACT 
.\CAGGGAACT 
GATAAAGCCC 
CTGGATAGAA 
AGAGGAGATC 
CTTCGTCTCC 
TTGTGCATAC 
GACTTTAACT 
AGCCCTGTAC 
ATATGGGGCA 
CAGCCCCTCT 
ACCTCTGGCG 
AGTCGGCGAC 
AGGACCTTAC 
TTGGATACAC 
GACATGGCGC 
CTAATCCCCT 
GCGGCCCAGC 
GAAGTAGCGA 
GCAATTCGTA 
CGAGTAACTG 
GATTTTGACA 
CGAGTGGTAA 
TTTGTGTTAA 
CTCAAATATA 
AAAGACAGGA 
CCTATAGAGT 
GGAATGAAAG 
GCATGGAAAA 
ACAGTCGAGA 
AATTTCACAA 
AATGTATCTT 
CTCCTCTACT 
GTTAATTAGG 
AAGGGTAATT 
AAACAGCCCA 
AACACCCTGC 
CACATTTTCC 
AGGAACCCAG 
GTTCATCTGG 
GGTCCTGTAG 
ATGAAAATTT 
TGAATGCAAG 
CCACATCAAA 



TTGCTTTTGG 
CTGCTCCCAC 
CACGGAACCC 
ACGGATGGAA 
GAGACCGAGG 
CTGATAGCAC 
GATAGCCGTT 
TTGCTCACAT 
GCCCTCTTTC 
AGCGCCGAGG 
GAGACTCCAG 
TTTCATTACA 
ACAAAGAAGT 
TTATTAGACT 
GAGAGAAGCC 
GAGACCTGCA 
AGGGTCCGCG 
GGATTGTATG 
GCCTTCCCAA 
TTCCCCAGGT 
AAGGTGAGTC 
AGACCCCAAA 
AAATTAACGC 
CGAGCCCGCA 
CCCCCGCCCC 
CTCCAAGCTC 
GCAGCCTACC 
ACAGTGTGGG 
ACAGTCCTGC 
GCCGCCCACG 
GTTCAACGCT 
TAATTCTTCT 
CGGCCACCAT 
CAGAGAAGAT 
CGAAAACAGG 
TTTGTGCAGA 
CGATTGTAGC 
GTCCTTGTGG 
TAGAAATGAA 
CCCGAAATTA 
TATCAGTGGT 
ACGAGCCATA 
ACCCCACCTG 
ATACATAACT 
ACTTGTTTAT 
ATAAAGCATT 
ATCATGTCTG 
TGAGAGGACA 
TCACTTAACA 
TTAAAATATC 
CAAATGTCAA 
TCATCAAGAA 
CCACCTGTGT 
CACTCCACTG 
TGACTGTCAA 
TTTGCTAACA 
GACCCTTGAA 
TTTAACATAG 
ATATTTCCAC 



ACACGGACCG 
TGCCTGAGGA 
GACCCGACCT 
GCAGTCTCTT 
TAATCTGGGC 
TCACCCAGGC 
ATGCTTTTGC 
CAGAAGGCAA 
TGCCCAAAAG 
CTAGAGGCAA 
ACACCTCTAC 
CAGTGACTGA 
ATTGGGTCTA 
TTCTTCATCA 
ACAGTCCCTA 
AAGCTTGTGC 
GGCATCGGCC 
GCTATAAATA 
CCAAGAAAGA 
TCGGCATGCC 
AGACAGTGGC 
GCTCAGGCCA 
TTGCAACTGG 
ACACGCCGGG 
TTGTAAACTT 
ACTTACAGGC 
AAGAACAACT 
TCCGCCGACA 
TGACCACCCC 
TGAAGGCTGC 
CTCAAAACCC 
GATGCTCAGA 
GAAAACATTT 
TACAATGCTT 
AGAAATCATT 
AGCCATTGCG 
TGTTAGACAC 
TATGTGTAGG 
TGGCAAGTTA 
AAAGTTTTAC 
CCAGGCTCTA 
GATAAAATAA 
TAGGTTTGGC 
GAGAATAGAG 
TGCAGCTTAT 
TTTTTCACTG 
GATCCCCAGG 
TTCCAATCAT 
AAAAGGAAAT 
TGGGAAGTCC 
CAGCAGAAAC 
GCACTGTGGT 
AGGTTCCAAA 
GATAAGCATT 
CTGTAGCATT 
CACCCTGCAG 
TGGGTTTTCC 
CAGTTACCCC 
AGGTTAAGTC 



GGTCCAGTTC 
AGGGCTGCAA 
AACGGACCAG 
ACAAGAGGGA 
TAAAGCCCTG 
CCTAAAGATG 
TACTGCCCAT 
AGAGATCAAA 
ACTTAGCATA 
CCGGATGGCT 
CCTCCTCATA 
TATAAAGGAC 
CCAAGGAAAA 
GCTGACTCAC 
CTACATGCTG 
ACAAGTCAAC 
CGGCACTCAT 
TCTTCTAGTT 
AACCGCCAAG 
TCAGGTATTG 
CGATCTGTTG 
GGTAGAAAGA 
CTCTAGAGAC 
CCCCCATGGC 
CCCTGACCCT 
TCTCTACTTA 
GGACCGACCG 
CCAGACTAAG 
CACCGCCCTC 
CGACCCCGGG 
CTTAAA3\ATA 
GGGGTCAGTA 
AACATTTCTC 
TATGAGGATA 
TCGGCAGTAC 
ATTGGTAGTG 
C CTTATTCTG 
GAGTTGATTT 
GTCAAAACTA 
CACCAAGCTT 
GTTTTGACTC 
AAGATTTTAT 
AAGCTAGCTT 
AAGTTCAGAT 
AATGGTTACA 
CATTCTAGTT 
AAGCTCCTCT 
AGGCTGCCCA 
TGGGTAGGGG 
CTTCCACTGC 
ATACAAGCTG 
TGCTGTGTTA 
ATATCTAGTG 
ATCCTTATCC 
TTTTGGGGTT 
CTCCAAAGGT 
AGCACCATTT 
AATAACCTCA 
CTCATTTAAA 



GGACCGGTGG 
GACAACTGCC 
CCGCTCCCAG 
CAGCGTAAGG 
CCAGCCGGGA 
GCAGAAGGTA 
ATCCATGGAG 
AATAAAGACG 
ATCCATTGTC 
GACCAAGCGG 
GAAAATTCAT 
CTAACCAAGT 
CCTGTGATGC 
CTCAGCTTCT 
AACCGGGATC 
GCCAGCAAGT 
TGGGAGATCG 
TTTATAGATA 
GTCGTAACCA 
GGAACTGACA 
GGGATTGATT 
ATGAATAGAA 
TGGGTGCTCC 
CTCACCCCAT 
GACATGACAA 
GTCCAGCACG 
GTGGTACCTC 
AACCTAGAAC 
AAAGTAGACG 
GGTGGACCAT 
AGGTTAACCC 
CTGCTTCGCC 
AACAAGATCT 
ATAAACATCA 
ATATTGAAGC 
CAGTTTCGAA 
ACGAAGTAGA 
CAGACTATGC 
CGATTGAAGA 
ATCGATTAGT 
AACAATATCA 
TTAGTCTCCA 
AAGTAACGCC 
CAAGGTCAGG 
AATAAAGCAA 
GTGGTTTGTC 
GTGTCCTCAT 
TCCACCCTCT 
TTTTTCACAG 
TGTGTTCCAG 
TCAGCTTTGC 
GTAATGTGCA 
TTTTCATTTT 
AAAACAGCCT 
ACAGTTTGAG 
TCCCCACCAA 
TCATGAGTTT 
GTTTTAACAG 
TTAGGCAAAG 



TAGCCCTGAA 
TTGATATCCT 
ACGCCGACCA 
CGGGAGCTGC 
CATCCGCTCA 
AGAAGCTAAA 
AAATATACAG 
AGATCTTGGC; 
CAGGACATCA 
CCCGAAAGGC 
CACCCTACAC 
TGGGGGCCAT 
CTGACCAGTT 
CAAAAATGAA 
GAACACTCAA 
CTGCCGTTAA 
ATTTCACCGA 
CCTTTTCTGG 
AGAAGCTACT 
ATGGGCCTGC 
GGAAATTACA 
CCATCAAGGA 
TACTCCCCTT 
ATGAGATCTT 
GAGTTACTAA 
AAGTCTGGAG 
ACCCTTACCG 
CTCGCTGGAA 
GCATCGCAGC 
CCTCTAGACT 
GCGAGGCCCC 
CGGCTCCAGT 
AGAATTAGTA 
TGTGGGAGCG 
GTATATAGGA 
TGGACAAAAG 
TAGAAGTATT 
ACCAGATTGT 
ACTCATTCCA 
CCAATTTGTT 
CCAGCTGAAG 
GAAAAAGGGG 
ATTTTGCAAG 
AACAGATGGA 
TAGCATCACA 
CAAACTCATC 
AAACCCTAAC 
GTGTCCTCCT 
ACCGCTTTCT 
AAGTGTTGGT 
ACAAGGGCCC 
AAACAGGAGG 
TACTTGGATC 
TGTGGTCAGT 
CAGGATATTT 
CAGCAAAAAA 
TTTGTGTCCC 
TAACAGCTTC 
GAATTC 



4140 
4200 
4260 
4320 
. 4380 
■ 4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
57.00 
5760 
5820 
5880 
5940 
6000 
6060 
612 0 
6180 
6240 
6300 
63.60 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7616 
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AGATCTCCCG 
AGCCAGTATC 
TAAGCTACAA 
CGTTTTGCGC 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
GCTTATCGAA 
CTTTTTCGCT 
TAGAATGGGA 
CTTTCTACTC 
TTCGTTAAAC 
AGATTGTAAG 
TACTTCAGCA 
GCATATAAAT 
CATCATCCTG 
ATACTCTGAG 
ACAGCTCCTG 
GCAAGCTTCT 
GAGAATATGG 
GTCGAGCGGA 
TGCTCTGCAG 
GACCTCATCA 
CAGGTCCCCT 
CCCTTTGTAC 
GAACCTCCTC 
GGCGCCAAAC 
ACAGAAGACC 
GGTGGAGAAG 
CGTGGGAGAC 
GCAGGAGGAA 
AAAAATAATA 
GTTCTCATCA 
ACCGGAGAAG 
GGGCGCCCCA 
TGGGATTACA 
GCGGGTCTCC 
CAAGGGCCCA 
TACACTCCTT 
TGGCAGTCTG 
ACGCTTGGAG 
GAAAGAGAGG 
GATGAGCAGA 
GCCACTGTCG 
CTCGATCGCG 
AAGAAACCAC 
TAGGGAGGTC 
CAACCCGTCA 
GGACCCCTAA 
TGGACCACGG 
GTACCAGACT 
ATCCACTTTG 
TTGACCCTAA 
TCTCTAGGGT 
GGACTGGCAG 
TCCATAAAAC 
AGACTGTTGG 
CCCGTTAAGA 
AAGCGGGTGG 
CCACCGTCCC 
CTCCACCCCA 



ATCCCCTATG 

TGCTCCCTGC 

CAAGGCAAGG 

TGCTTCGCGA 

AGTAATCAAT 

TTACGGTAAA 

TGACGTATGT 

ATTTACGGTA 

CTATTGACGT 

GGGACTTTCC 

GGTTTTGGCA 

TCCACCCCAT 

AATGTCGTAA 

TCTATATAAG 

ATGTCGACTG 

ATTGTAAAAT 

AGATGTCCCT 

TGTTGACAAC 

TTTAGCTTGC 

TACTTTCTCT 

CAGTTTTAGA 

TCTGGCTGGC 

CCTTTCTCTT 

TCCAAACCGG 

GGCAACGTGC 

GCAGCATCGT 

GCCAGACTGT 

TCGCTCACAA 

AATGGCCAAC 

CCCAGGTTAA 

ACATCGTGAC 

ACCCTAAGCC 

GTTCGACCCC 

CTAAACCTCA 

CCCCGCCTTA 

CGACCCCTGC 

GGGAGCCCCC 

ACGGACAGCT 

ACCCTTCTTT 

CCCATCAGCC 

AAAAACAACG 

CTCAACTGCC 

CCACCCAGGC 

AAAACGCGGG 

ATGAGTCTCC 

ATGACCCTGA 

CCCCAGACAT 

ATTTGGTTAG 

AACGTATCAG 

AAGAGAAAGA 

TTAGTGGACA 

ACCAGTGTGC 

GAGGACCTCG 

AGGGTCAGGA 

CCTTCCTGGT 

GTGATAAGTC 

ATCGCAAAGT 

GTCCCTATCC 

AGGGATCAGG 

ATATAGAAGA 

CCACATGGCT 

TTCGCCAAGC 

AATACCCCAT 

ACCAGGGAAT 

AACCAGGGAC 

AAGACATCCA 

ACCAGTGGTA 

CCAGTCAGCC 



GTCGACTCTC 
TTGTGTGTTG 
CTTGACCGAC 
TGTACGGGCC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCTC 
AGAACTTCAG 
TCATGTTATA 
TGTATCACCA 
CATTGTCTCC 
ATTTGTAACG 
AATCACTTTT 
GAACAATTGT 
GTGGAAATAT 
TATGGTTACA 
GCCCCTCTGC 
TGGTTGTTGT 
TCTGTGTTGT 
TACCACTCCC 
CCAGTCGGTA 
CTTTAACGTC 
GATCAAGGTC 
CTGGGAAGCC 
TCCGCCTCCT 
GCCTCGATCC 
AGTTCTTTCT 
TAGGGACCCA 
GGGAGAGGCA 
TGTGGCCGAC 
TCAATACTGG 
TTCTGAAGAT 
CACCTGGGAC 
GGTGCTCTTA 
CAATGAAGTC 
AGGTAGGAAC 
CAGAAGCCCC 
CTCGGCCTTC 
GGACCCAGGG 
TGGGAGAAAG 
AGAGGCAGAA 
GAGAGAAACA 
AAGAGATCGT 
GAAACAGGAT 
CTACTGCAAA 
GGGACCAAGA 
GCCCCCCCCT 
AGATACTGGG 
TGCCTGGGTC 
ACATCTAGCT 
TCTGTTAGGA 
AGCTCAGGTT 
TGAGCATCGG 
GTCTGATTTT 
TCCTCTGATC 
GTCACAAGAA 
ACTGGTACCC 
TAATGATTAT 
CCCCACCGTG 
CACTGTGCTT 
TCTCTTCGCC 
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AGTACAATCT 

GAGGTCGCTG 

AATTGCATGA 

AGATATACGC 

TTAGTTCATA 

GGCTGACCGC 

ACGCCAATAG 

TTGGCAGTAC 

AAATGGCCCG 

TACATCTACG 

GGGCGTGGAT 

GGGAGTTTGT 

CCATTGACGC 

TGGCTAACTA 

GGTGAGTTTG 

TGGAGGGGGC 

TGGACCCTCA 

TCTTATTTTC 

AATTTTTAAA 

TTTTCAAGGC 

TATAATTAAA 

TCTTATTGGT 

ATGATATACA 

TAACCATGTT 

GCTGTCTCAT 

CTCTGTCTGA 

TTAAGTTTGA 

GATGTCAAGA 

GGATGGCCGC 

TTTTC ACC TG 

TTGGCTTTTG 

CTTCCTCCAT 

TCCCTTTATC 

GACAGTGGGG 

AGACCACCCC 

CCGGACCCCT 

TCCACTACCT 

CCGTTCTCCT 

CCAGGTAAAC 

GACTGTCAGC 

GAGGCTAGAA 

GATGCCGCTT 

CACCTAGTCC 

ACCAATTTGG 

CTAGAGAGAC 

CAAGAAACTA 

TTAGAGAGGT 

AAGATCTTTA 

GAGGAAAAAG 

AGGAGACATA 

AGACAGGGAG 

GAAAAGGGGC 

CCCCAGACCT 

GAACCCAGGA 

GCCCAACACT 

CAAGGGGCTA 

ACCGGTAAGG 

AGAGATTTGC 

ATGGGACCAA 

CTACATGAGA 

CCTCAGGCCT 

ATACCTCTGA 

GCCAGACTGG 

TGCCAGTCCC 

AGGCCTGTCC 

CCCAACCCTT 

GATTTAAAGG 

TTTGAGTGGA 
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GCTCTGATGC 

AGTAGTGCGC 

AGAATCTGCT 

GTTGACATTG 

GCCCATATAT 

CCAACGACCC 

GGACTTTCCA 

ATCAAGTGTA 

CCTGGCATTA 

TATTAGTCAT 

AGCGGTTTGA 

TTTGGCACCA 

AAATGGGCGG 

GAGAACCCAC 

GGGACCCTTG 

AAAGTTTTCA 

TGATAATTTT 

TTTTCATTTT 

TTCACTTTTG 

AATCAGGGTA 

TGATAAGGTA 

AGAAACAACT 

CTGTTTGAGA 

CATGCCTTCT 

CATTTTGGCA 

CTGTGTTTCT 

CCTTAGGTCA 

AGAGACGTTG 

GAGACGGCAC 

GCCCGCATGG 

ACCCCCCTCC 

CCGCCCCGTC 

CAGCCCTCAC 

GGCCGCTCAT 

CTTCCGACAG 

CCCCAATGGC 

CGCAGGCATT 

CTTCTGACCT 

TGACAGCTCT 

AGCTGTTGGG 

AGGCGGTGCG 

TTCCCCTCGA 

ACTATCGCCA 

CCAAGGTAAA 

TTAAGGAAGC 

ATGTGTCTAT 

TAGAAGATTT 

ATAAACGAGA 

AAGAACGCCG ' 

GAGAGATGAG 

GAGAACGAAG 

ACTGGGCTAA 

CCCTCCTGAC 

TAACCCTCAA 

CCGTGCTGAC 

CTGGAGGAAA 

TCACCCACTC 

TGACTAAACT 

TGGGGCAGCC- 

CCTCAAAAGA 

GGGCGGAAAC 

AAGCAACCTC 

GGATCAAGCC 

CCTGGAACAC 

AGGATCTGAG 

ACAACCTCTT 

ATGCCTTTTT 

GAGATCCAGA 



CGCATAGTTA 

GAGCAAAATT 

TAGGGTTAGG 

ATTATTGACT 

GGAGTTCCGC 

CCGCCCATTG 

TTGACGTCAA 

TCATATGCCA 

TGCCCAGTAC 

CGCTATTACC 

CTCACGGGGA 

AAATCAACGG 

TAGGCGTGTA 

TGCTTAACTG 

ATTGTTCTTT 

GGGTGTTGTT 

GTTTCTTTCA 

CTGTAACTTT 

TTTATTTGTC 

TATTATATTG 

GAATATTTCT 

ACATCCTGGT 

TGAGGATAAA 

TCTTTTTCCT 

AGAATTGGCC 

GTATTTGTCT 

CTGGAAAGAT 

GGTTACCTTC 

CTTTAACCGA 

ACACCCAGAC 

CTGGGTCAAG 

TCTCCCCCTT 

TCCTTCTCTA 

CGACCTACTT 

GGACGGAAAT 

ATCTCGCCTA 

CCCCCTCCGC 

TTACAACTGG 

GATCGAGTCT 

GACTCTGCTG 

GGGCGATGAT 

GCGCCCAGAC 

GTTGCTCCTA 

AGGAATAACA 

CTATCGCAGG 

GTCTTTCATT 

AAAAAACAAG 

AACCCCGGAA 

TAGGACAGAG 

CAAGCTATTG 

GAGGTCCCAA 

AGATTGTCCC 

CCTAGATGAC 

AGTCGGGGGG 

CCAAAATCCT 

GCGGTATCGC 

TTTCCTCCAT 

AAAAGCCCAA 

CCTGCAAGTG 

GCCAGATGTT 

CGGGGGCATG 

TACCCCCGTG 

CCACATACAG 

GCCCCTGCTA 

AGAAGTCAAC 

GAGCGGGCTC 

CTGCCTGAGA 

GATGGGAATC 
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600 
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900 
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4080 
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Figure 7. hCMV+intron Sequence 



TCAGGACAAT 
GATGAGGCAC 
CTACAGTACG 
ACTCGGGCCC 
CAAATTTGCC 
CTGACTGAGG 
CTAAGGGAGT 
ATGGCAGCCC 
CAACAAAAGG 
CCAGATTTGA 
GTCCTAACGC 
GACCCAGTAG 
ACAAAGGATG 
GTAGAGGCAC 
TATCAGGCCT 
CCGGCTACGC 
GCCGAAGCCC 
ACCTGGTACA 
GTGACCACCG 
CGGGCTGAAC 
GTTTATACTG 
AGGCGTGGGT 
CTACTAAAAG 
AJ^.GGGACACA 
GCCATCACAG 
TCAGAACATT 
TATGATAAAA 
ACTTTTGAAT 
GCTCTCCTAG 
AATATCACTG 
CAGGGAACTA 
ATAAAGCCCG 
TGGATAGAAG 
GAGGAGATCT 
TTCGTCTCCA 
TGTGCATACA 
ACTTTAACTA 
GCCCTGTACC 
TATGGGGCAC 
AGCCCCTCTC 
CCTCTGGCGG 
GTCGGCGACA 
GGACCTTACA 
TGGATACACG 
ACATGGCGCG 
TAATCCCCTT 
CGGCCCAGCC 
AAGTAGCGAC 
CAATTCGTAC 
GAGTAACTGT 
ATTTTGACAC 
GAGTGGTAAG 
TTGTGTTAAT 
TCAAATATAC 



TGACCTGGAC 
TGCACAGAGA 
TGGATGACTT 
TGTTACAAAC 
AGAAACAGGT 
CCAGAAAAGA 
TCCTAGGGAC 
CCTTGTACCC 
CCTATCAAGA 
CTAAGCCCTT 
AAAAACTGGG 
CAGCTGGGTG 
CAGGCAAGCT 
TAGTCAAACA 
TGCTTTTGGA 
TGCTCCCACT 
ACGGAACCCG 
CGGATGGAAG 
AGACCGAGGT 
TGATAGCACT 
ATAGCCGTTA 
TGCTCACATC 
CCCTCTTTCT 
GCGCCGAGGC 
AGACTCCAGA 
TTCATTACAC 
CAAAGAAGTA 
TATTAGACTT 
AGAGAAGCCA 
AGACCTGCAA 
GGGTCCGCGG 
GATTGTATGG 
CCTTCCCAAC 
TCCCCAGGTT 
AGGTGAGTCA 
GACCCCAAAG 
AATTAACGCT 
GAGCCCGCAA 
CCCCGCCCCT 
TCCAAGCTCA 
CAGCCTACCA 
CAGTGTGGGT 
CAGTCCTGCT 
CCGCCCACGT 
TTCAACGCTC 
AATTCTTCTG 
GGCCACCATG 
AGAGAAGATT 
GAAAACAGGA 
TTGTGCAGAA 
GATTGTAGCT 
TCCTTGTGGT 
AGAAATGAAT 
CCGAAATTAA 



CAGACTCCCA 
CCTAGCAGAC 
ACTGCTGGCC 
CCTAGGGAAC 
CAAGTATCTG 
GACTGTGATG 
GGCAGGCTTC 
TCTCACCAAA 
AATCAAGCAA 
TGAACTCTTT 
ACCTTGGCGT 
GCCCCCTTGC 
AACCATGGGA 
ACCCCCCGAC 
CACGGACCGG 
GCCTGAGGAA 
ACCCGACCTA 
CAGTCTCTTA 
AATCTGGGCT 
CACCCAGGCC 
TGCTTTTGCT 
AGAAGGCAAA 
GCCCAAAAGA 
TAGAGGCAAC 
CACCTCTACC 
AGTGACTGAT 
TTGGGTCTAC 
TCTTCATCAG 
CAGTCCCTAC 
AGCTTGTGCA 
GCATCGGCCC 
CTATAAATAT 
CAAGAAAGAA 
CGGCATGCCT 
GACAGTGGCC 
CTCAGGCCAG 
TGCAACTGGC 
CACGCCGGGC 
TGTAAACTTC 
CTTTACAGGCT 
AGAACAACTG 
CCGCCGACAC 
GACCACCCCC 
GAAGGCTGCC 
TCAAAACCCC 
ATGCTCAGAG 
AAAACATTTA 
ACAATGCTTT 
GAAATCATTT 
GCCATTGCGA 
GTTAGACACC 
ATGTGTAGGG 
GGCAAGTTAG 
AAGTTTTACC 
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CAGGGTTTCA 
TTCCGGATCC 
GCCACTTCTG 
CTCGGGTATC 
GGGTATCTTC 
GGGCAGCCTA 
TGTCGCCTCT 
ACGGGGACTC 
GCTCTTCTAA 
GTCGACGAGA 
CGGCCGGTGG 
CTACGGATGG 
CAGCCACTAG 
CGCTGGCTTT 
GTCCAGTTCG 
GGGCTGCAAC 
ACGGACCAGC 
CAAGAGGGAC 
AAAGCCCTGC 
CTAAAGATGG 
ACTGCCCATA 
GAGATCAAAA 
CTTAGCATAA 
CGGATGGCTG 
CTCCTCATAG 
ATAAAGGACC 
CAAGGAAAAC 
CTGACTCACC 
TACATGCTGA 
CAAGTCAACG 
GGCACTCATT 
CTTCTAGTTT 
ACCGCCAAGG 
CAGGTATTGG 
GATCTGTTGG 
GTAGAAAGAA 
TCTAGAGACT 
CCCCATGGCC 
CCTGACCCTG 
CTCTACTTAG 
GACCGACCGG 
CAGACTAAGA 
ACCGCCGTCA 
GACCCCGGGG 
TTAAAAATAA 
GGGTCAGTAC 
ACATTTCTCA 
ATGAGGATAA 
CGGCAGTACA 
TTGGTAGTGC 
CTTATTCTGA 
AGTTGATTTC 
TCAAAACTAC 
ACCAAGCTTA 



AAAACAGTCC 
AGCACCCAGA 
AGCTAGACTG 
GGGCCTCGGC 
TAAAAGAGGG 
CTCCGAAGAC 
GGATCCCTGG 
TGTTTAATTG 
CTGCCCCAGC 
AGCAGGGCTA 
CCTACCTGTC 
TAGCAGCCAT 
TCATTCTGGC 
CCAACGCCCG 
GACCGGTGGT 
ACAACTGCCT 
CGCTCCCAGA 
AGCGTAAGGC 
CAGCCGGGAC 
CAGAAGGTAA 
TCCATGGAGA 
ATAAAGACGA 
fCCATTGTCC 
ACCAAGCGGC 
AAAATTCATC 
TAACCAAGTT 
CTGTGATGCC 
TCAGCTTCTC 
ACCGGGATCG 
CCAGCAAGTC 
GGGAGATCGA 
TTATAGATAC 
TCGTAACCAA 
GAACTGACAA 
GGATTGATTG 
TGAATAGAAC 
GGGTGCTCCT 
TCACCCGATA 
ACATGACAAG 
TCCAGCACGA 
TGGTACCTCA 
ACCTAGAACC 
AAGTAGACGG 
GTGGACCATC 
GGTTAACCCG 
TGCTTCGCCC 
ACAAGATCTA 
TAAACATCAT 
TATTGAAGCG 
AGTTTCGAAT 
CGAAGTAGAT 
AGACTATGCA 
GATTGAAGAA 
TCGAATTC 



CACCCTGTTT 
CTTGATCCTG 
CCAACAAGGT 
CAAGAAAGCC 
TCAGAGATGG 
CCCTCGACAA 
GTTTGCAGAA 
GGGCCCAGAC 
CCTGGGGTTG 
CGCCAAAGGT 
CAAAAAGCTA 
TGCCGTACTG 
CCCCCATGCA . 
GATGACTCAC 
AGCCCTGAAC 
TGATATCCTG 
CGCCGACCAC 
GGGAGCTGCG 
ATCCGCTCAG 
GAAGCTAAAT 
AATATACAGA 
GATCTTGGCC 
AGGACATCAA 
CCGAAAGGCA 
ACCCTACACC 
GGGGGCCATT 
TGACCAGTTT 
AAAAATGAAG 
AACACTCAAA 
TGCCGTTAAA 
TTTCACCGAG 
CTTTTCTGGC 
GAAGCTACTA 
TGGGCCTGCC 
GAAATTACAT 
CATCAAGGAG 
ACTCCCCTTA 
TGAGATCTTA 
AGTTACTAAC 
AGTCTGGAGA 
CCCTTACCGA 
TCGCTGGAAA 
CATCGCAGCT 
CTCTAGACTG 
CGAGGCCCCC 
GGCTCCAGTG 
GAATTAGTAG 
GTGGGAGCGG 
TATATAGGAC 
GGACAAAAGG 
AGAAGTATTC 
CCAGATTGTT 
CTCATTCCAC 
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AGATCTCCCG ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA 
AGCCAGTATC TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT 
TAAGCTACAA CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG 
CGTTTTGCGC TGCTTCGCGA TGTACGGGCC AGATATACGC GTTGACATTG ATTATTGACT 
AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 
GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 
ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 
TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 
ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 
ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 
GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 
CGGTGGGAGG TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTAACTG 
GCTTATCGAA ATGTCGACTG AGAACTTCAG GGTGAGTTTG GGGACCCTTG ATTGTTCTTT 
CTTTTTCGCT ATTGTAAAAT TCATGTTATA TGGAGGGGGC AAAGTTTTCA GGGTGTTGTT 
TAGAATGGGA AGATGTCCCT TGTATCACCA TGGACCCTCA TGATAATTTT GTTTCTTTCA 
CTTTCTACTC TGTTGACAAC CATTGTCTCC TCTTATTTTC TTTTCATTTT CTGTAACTTT 
TTCGTTAAAC TTTAGCTTGC ATTTGTAACG AATTTTTAAA TTCACTTTTG TTTATTTGTC 
AGATTGTAAG TACTTTCTCT AATCACTTTT TTTTCAAGGC AATCAGGGTA TATTATATTG 
TACTTCAGCA CAGTTTTAGA GAACAATTGT TATAATTAAJv TGATAAGGTA GAATATTTCT 
GCATATAAAT TCTGGCTGGC GTGGAAATAT TCTTATTGGT AGAAACAACT ACATCCTGGT 
CATCATCCTG CCTTTCTCTT TATGGTTACA ATGATATACA CTGTTTGAGA TGAGGATAAA 
ATACTCTGAG TCCAAACCGG GCCCCTCTGC TAACCATGTT CATGCCTTCT TCTTTTTCCT 
ACAGCTCCTG GGCAACGTGC TGGTTGTTGT GCTGTCTCAT CATTTTGGCA AGAATTGGCC 
GCAAGCTTCT GCAGCATCGT TCTGTGTTGT CTCTGTCTGA CTGTGTTTCT GTATTTGTCT 
GAGAATATGG GCCAGACTGT TACCACTCCC TTAAGTTTGA CCTTAGGTCA CTGGAAAGAT 
GTCGAGCGGA TCGCTCACAA CCAGTCGGTA GATGTCAAGA AGAGACGTTG GGTTACCTTC 
TGCTCTGCAG AATGGCCAAC CTTTAACGTC GGATGGCCGC GAGACGGCAC CTTTAACCGA 
GACCTCATCA CCCAGGTTAA GATCAAGGTC TTTTCACCTG GCCCGCATGG ACACCCAGAC 
CAGGTCCCCT ACATCGTGAC CTGGGAAGCC TTGGCTTTTG ACCCCCCTCC CTGGGTCAAG 
CCCTTTGTAC ACCCTAAGCC TCCGCCTCCT CTTCCTCCAT CCGCCCCGTC TCTCCCCCTT 
GAACCTCCTC GTTCGACCCC GCCTCGATCC TCCCTTTATC CAGCCCTCAC TCCTTCTCTA 
GGCGCCAAAC CTAAACCTCA AGTTCTTTCT GACAGTGGGG GGCCGCTCAT CGACCTACTT 
ACAGAAGACC CCCCGCCTTA TAGGGACCCA AGACCACCCC CTTCCGACAG GGACGGAAAT 
GGTGGAGAAG CGACCCCTGC GGGAGAGGCA CCGGACCCCT CCCCAATGGC ATCTCGCCTA 
CGTGGGAGAC GGGAGCCCCC TGTGGCCGAC TCCACTACCT CGCAGGCATT CCCCCTCCGC 
GCAGGAGGAA ACGGACAGCT TCAATACTGG CCGTTCTCCT CTTCTGACCT TTACAACTGG 
AAAAATAATA ACCCTTCTTT TTCTGAAGAT CCAGGTAAAC TGACAGCTCT GATCGAGTCT 
GTTCTCATCA CCCATCAGCC GACCTGGGAC GACTGTCAGC AGCTGTTGGG GACTCTGCTG 
ACCGGAGAAG AAAAACAACG GGTGCTCTTA GAGGCTAGAA AGGCGGTGCG GGGCGATGAT 
GGGCGCCCCA CTCAACTGCC CAATGAAGTC GATGCCGCTT TTCCCCTCGA GCGCCCAGAC 
TGGGATTACA CCACCCAGGC AGGACGCAAC CACCTAGTCC ACTATCGCCA GTTGCTCCTA 
GCGGGTCTCC AAAACGCGGG CAGAAGCCCC ACCAATTTGG CCAAGGTAAA AGGAATAACA 
CAAGGGCCCA ATGAGTCTCC CTCGGCCTTC CTAGAGAGAC TTAAGGAAGC CTATCGCAGG 
TACACTCCTT ATGACCCTGA GGACCCAGGG CAAGAAACTA ATGTGTCTAT GTCTTTCATT 
TGGCAGTCTG CCCCAGACAT TGGGAGAAAG TTAGAGAGGT TAGAAGATTT AAAAAACAAG 
ACGCTTGGAG ATTTGGTTAG AGAGGCAGAA AAGATCTTTA ATAAACGAGA AACCCCGGAA 
GAAAGAGAGG AACGTATCAG GAGAGAAACA GAGGAAAAAG AAGAACGCCG . TAGGACAGAG 
GATGAGCAGA AAGAGAAAGA AAGAGATCGT AGGAGACATA GAGAGATGAG CAAGCTATTG 
GCCACTGTCG TTAGTGGACA GAAACAGGAT AGACAGGGAG GAGAACGAAG GAGGTCCCAA 
CTCGATCGCG ACCAGTGTGC CTACTGCAAA GAAAAGGGGC ACTGGGCTAA AGATTGTCCC 
AAGAAACCAC GAGGACCTCG GGGACCAAGA CCCCAGACCT CCCTCCTGAC CCTAGATGAC 
TAGGGAGGTC AGGGTCAGGA GCCCCCCCCT GAACCCAGGA TAACCCTCAA AGTCGGGGGG 
CAACCCGTCA CCTTCCTGGT AGATACTGGG GCCCAACACT CCGTGCTGAC CCAAAATCCT 
GGACCCCTAA GTGATAAGTC TGCCTGGGTC CAAGGGGCTA CTGGAGGAAA GCGGTATCGC 
TGGACCACGG ATCGCAAAGT ACATCTAGCT ACCGGTAAGG TCACCCACTC TTTCCTCCAT 
GTACCAGACT GTCCCTATCC TCTGTTAGGA AGAGATTTGC TGACTAAACT AAAAGCCCAA 
ATCCACTTTG AGGGATCAGG AGCTCAGGTT ATGGGACCAA TGGGGCAGCC CCTGCAAGTG 
TTGACCCTAA ATATAGAAGA TGAGCATCGG CTACATGAGA CCTCAAAAGA GCCAGATGTT 
TCTCTAGGGT CCACATGGCT GTCTGATTTT CCTCAGGCCT GGGCGGAAAC CGGGGGCATG 
GGACTGGCAG TTCGCCAAGC TCCTCTGATC ATACCTCTGA AAGCAACCTC TACCCCCGTG 
TCCATAAAAC AATACCCCAT GTCACAAGAA GCCAGACTGG GGATCAAGCC CCACATACAG 
AGACTGTTGG ACCAGGGAAT ACTGGTACCC TGCCAGTCCC CCTGGAACAC GCCCCTGCTA 
CCCGTTAAGA AACCAGGGAC TAATGATTAT AGGCCTGTCC AGGATCTGAG AGAAGTCAAC 
AAGCGGGTGG AAGACATCCA CCCCACCGTG CCCAACCCTT ACAACCTCTT GAGCGGGCTC 
CCACCGTCCC ACCAGTGGTA CACTGTGCTT GATTTAAAGG ATGCCTTTTT CTGCCTGAGA 
CTCCACCCCA CCAGTCAGCC TCTCTTCGCC TTTGAGTGGA GAGATCCAGA GATGGGAATC 
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TCAGGACAAT 

GATGAGGCAC 

CTACAGTACG 

ACTCGGGCCC 

CAAATTTGCC 

CTGACTGAGG 

CTAAGGGAGT 

ATGGCAGCCC 

CAACAAAAGG 

CCAGATTTGA 

GTCCTAACGC 

GACCCAGTAG 

ACAAAGGATG 

GTAGAGGCAC 

TATCAGGCCT 

CCGGCTACGC 

GCCGAAGCCC 

ACCTGGTACA 

GTGACCACCG 

CGGGCTGAAC 

GTTTATACTG 

AGGCGTGGGT 

CTACTAAAAG 

AAGGGACACA 

GCCATCACAG 

TCAGAACATT 

TATGATAAAA 

ACTTTTGAAT 

GCTCTCCTAG 

AATATCACTG 

CAGGGAACTA 

ATAAAGCCCG 

TGGATAGAAG 

GAGGAGATCT 

TTCGTCTCCA 

TGTGCATACA 

ACTTTAACTA 

GCCCTGTACC 

TATGGGGCAC 

AGCCCCTCTC 

CCTCTGGCGG 

GTCGGCGACA 

GGACCTTACA 

TGGATACACG 

ACATGGCGCG 

TAATCCCCTT 

CGGCCCAGCC 

AAGTAGCGAC 

CAATTCGTAC 

GAGTAACTGT 

ATTTTGACAC 

GAGTGGTAAG 

TTGTGTTAAT 

TCAAATATAC 



TGACCTGGAC 
TGCACAGAGA 
TGGATGACTT 
TGTTACAAAC 
AGAAACAGGT 
CCAGAAAAGA 
TCCTAGGGAC 
CCTTGTACCC 
CCTATCA?^GA 
CTAAGCCCTT 
AAAAACTGGG 
CAGCTGGGTG 
CAGGCAAGCT 
TAGTCAAACA 
TGCTTTTGGA 
TGCTCCCACT 
ACGGAACCCG 
CGGATGGAAG 
AGACCGAGGT 
TGATAGCACT 

"atagccgtta 
tgctcacatc 

CCCTCTTTCT 
GCGCCGAGGC 
AGACTCCAGA 
TTCATTACAC 
CAAAGAAGTA 
TATTAGACTT 
AGAGAAGCCA 
AGACCTGCAA 
GGGTCCGCGG 
GATTGTATGG 
CCTTCCCAAC 
TCCCCAGGTT 
AGGTGAGTCA 
GACCCCAAAG 
AATTAACGCT 
GAGCCCGCAA 
CCCCGCCCCT 
TCCAAGCTCA 
CAGCCTACCA 
CAGTGTGGGT 
CAGTCCTGCT 
CCGCCCACGT 
TTCAACGCTC 
AATTCTTCTG 
GGCCACCATG 
AGAGAAGATT 
GAAAACAGGA 
TTGTGCAGAA 
GATTGTAGCT 
TCCTTGTGGT 
AGAAATGAAT 
CCGAAATTAA 



CAGACTCCCA 

CCTAGCAGAC 

ACTGCTGGCC 

CCTAGGGAAC 

CAAGTATCTG 

GACTGTGATG 

GGCAGGCTTC 

TCTCACCAAA 

AATCAAGCAA 

TGAACTCTTT 

ACCTTGGCGT 

GCCCCCTTGC 

AACCATGGGA 

ACCCCCCGAC 

CACGGACCGG 

GCCTGAGGAA 

ACCCGACCTA 

CAGTCTCTTA 

AATCTGGGCT 

CACCCAGGCC 

TGCTTTTGCT 

AGAAGGCAAA 

GCCCAAAAGA 

TAGAGGCAAC 

CACCTCTACC 

AGTGACTGAT 

TTGGGTCTAC 

TCTTCATCAG 

CAGTCCCTAC 

AGCTTGTGCA 

GCATCGGCCC 

CTATAAATAT 

CAAGAAAGAA 

CGGCATGCCT 

GACAGTGGCC 

CTCAGGCCAG 

TGCAACTGGC 

CACGCCGGGC 

TGTAAACTTC 

CTTACAGGCT 

AGAACAACTG 

CCGCCGACAC 

GACCACCCCC 

GAAGGCTGCC 

TCAAAACCCC 

ATGCTCAGAG 

AAAACATTTA 

ACAATGCTTT 

GAAATCATTT 

GCCATTGCGA 

GTTAGACACC 

ATGTGTAGGG 

GGCAAGTTAG 

AAGTTTTACC 



CAGGGTTTCA 

TTCCGGATCC 

GCCACTTCTG 

CTCGGGTATC 

GGGTATCTTC 

GGGCAGCCTA 

TGTCGCCTCT 

ACGGGGACTC 

GCTCTTCTAA 

GTCGACGAGA 

CGGCCGGTGG 

CTACGGATGG 

CAGCCACTAG 

CGCTGGCTTT 

GTCCAGTTCG 

GGGCTGCAAC 

ACGGACCAGC 

CAAGAGGGAC 

AAAGCCCTGC 

CTAAAGATGG 

ACTGCCCATA 

GAGATCAAAA 

CTTAGCATAA 

CGGATGGCTG 

CTCCTCATAG 

ATAAAGGACC 

CAAGGAAAAC 

CTGACTCACC 

TACATGCTGA 

CAAGTCAACG 

GGCACTCATT 

CTTCTAGTTT 

ACCGCCAAGG 

CAGGTATTGG 

GATCTGTTGG 

GTAGAAAGAA 

TCTAGAGACT 

CCCCATGGCC 

CCTGACCCTG 

CTCTACTTAG 

GACCGACCGG 

CAGACTAAGA 

ACCGCCCTCA 

GACCCCGGGG 

TTAAAAATAA 

GGGTCAGTAC 

ACATTTCTCA 

ATGAGGATAA 

CGGCAGTACA 

TTGGTAGTGC 

CTTATTCTGA 

AGTTGATTTC 

TCAAAACTAC 

ACCAAGCTTA 



AAAACAGTCC 
AGCACCCAGA 
AGCTAGACTG 
GGGCCTCGGC 
TAAAAGAGGG 
CTCCGAAGAC 
GGATCCCTGG 
TGTTTAATTG 
CTGCCCCAGC 
AGCAGGGCTA 
CCTACCTGTC 
TAGCAGCCAT 
TCATTCTGGC 
CCAACGCCCG 
GACCGGTGGT 
ACAACTGCCT 
CGCTCCCAGA 
AGCGTAAGGC 
CAGCCGGGAC 
CAGAAGGTAA 
TCCATGGAGA 
ATAAAGACGA 
TCCATTGTCC 
ACCAAGCGGC 
AAAATTCATC 
TAACCAAGTT 
CTGTGATGCC 
TCAGCTTCTC 
ACCGGGATCG 
CCAGCAAGTC 
GGGAGATCGA 
TTATAGATAC 
TCGTAACCAA 
GAACTGACAA 
GGATTGATTG 
TGAATAGAAC 
GGGTGCTCCT 
TCACCCCATA 
ACATGACAAG 
TCCAGCACGA 
TGGTACCTCA 
ACCTAGAACC 
AAGTAGACGG 
GTGGACCATC 
GGTTAACCCG 
TGCTTCGCCC 
ACAAGATCTA 
TAAACATCAT 
TATTGAAGCG 
AGTTTCGAAT 
CGAAGTAGAT 
AGACTATGCA 
GATTGAAGAA 
TCGAATTC 



CACCCTGTTT 
CTTGATCCTG 
CCAACAAGGT 
CAAGAAAGCC 
TCAGAGATGG 
CCCTCGACAA 
GTTTGCAGAA 
GGGCCCAGAC 
CCTGGGGTTG 
CGCCAAAGGT 
CAlAAAAGCTA 
TGCCGTACTG 
CCCCCATGCA 
GATGACTCAC 
AGCCCTGAAC 
TGATATCCTG 
CGCCGACCAC 
GGGAGCTGCG 
ATCCGCTCAG 
GAAGCTAAAT 
AATATACAGA 
GATCTTGGCC 
AGGACATCAA 
CCGAAAGGCA 
ACCCTACACC 
GGGGGCCATT 
TGACCAGTTT 
AAAAATGAAG 
AACACTCAAA 
TGGCGTTAAA, 
TTTCACCGAG 
CTTTTCTGGC 
GAAGCTACTA 
TGGGCCTGCC 
GAAATTACAT 
CATCAAGGAG 
ACTCCCCTTA 
TGAGATCTTA 
AGTTACTAAC 
AGTCTGGAGA 
CCCTTACCGA 
TCGCTGGAAA 
CATCGCAGCT 
CTCTAGACTG 
CGAGGCCCCC 
GGCTCCAGTG 
GAATTAGTAG 
GTGGGAGCGG 
TATATAGGAC 
GGACAAAAGG 
AGAAGTATTC 
CCAGATTGTT 
CTCATTCCAC 
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CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT ou 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA ' 12 0 

CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT ISO 

TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 2 40 

GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACCA 300 

CTAGAATACG AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 3 50 

GAAAGACCCC ACCAAATTGC TTAGCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 420 

GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 480 

AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 540 

ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 600 

GGCCCAACCC TCAGCAGTTT CTTAAGACCC ATCAGATGTT TCCAGGCTCC CCCAAGGACC 660 

TGAAATGACC CTGTGCCTTA TTTGAATTAA CCAATCAGCC TGCTTCTCGC TTCTGTTCGC 72 0 

GCGCTTCTGC TTCCCGAGCT CTATAAAAGA GCTCACAACC CCTCACTCGG CGCGCCAGTC 780 

CTCCGATAGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT GCTGTTGCAT 840 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 900 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 96 0 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCTTATATGG GGCACCCCCG CCCCTTGTAA 102 0 

ACTTCCCTGA CCCTGACATG ACCAGAGTTA CTAACAGCCC CTCTCTCCAA GCTCACTTAC 1080 

AGGCTCTCTA CTTAGTCCAG CACGAAGTTT GGAGACCACT GGCGGCAGCT TACCAAGAAC 1140 

AACTGGACCG GCCGGTGGTG CCTCACCCTT ACCGGGTCGG CGACACAGTG TGGGTCCGCC 1200 

GACATCAAAC CAAGAACCTA GAACCTCGCT GGAAAGGACC TTACACAGTC CTGCTGACCA 12 60 

CCCCCACCGC CCTCAAAGTA * GACGGTATCG CAGCTTGGAT ACACGCAGCC CACGTAAAGG 13 2 0 

CGGCCGACAC CGAGAGTGGA CCATCCTCTG GACGGACATG GCGCGTTCAA CGCTCTCAAA 13 80 

ACCCCCTCAA GATAAGATTA ACCCGTGGAA GCCCTTAATA GTCATGGGAG TCCTGTTAGG 1440 

AGTAGGGATG GCAGAGAGCC CCCATCAGGT CTTTAATGTA ACCTGGAGAG TCACCAACCT 150 0 

GATGACTGGG CGTACCGCCA ATGCCACCTC CCTCCTGGGA ACTGTACAAG ATGCCTTCCC 1560 

AAAATTATAT TTTGATCTAT GTGATCTGGT CGGAGAGGAG TGGGACCCTT CAGACCAGGA 162 0 

ACCGTATGTC GGGTATGGCT GCAAGTACCC CGCAGGGAGA CAGCGGACCC GGACTTTTGA 168 0 

CTTTTACGTG TGCCCTGGGC ATACCGTAAA GTCGGGGTGT GGGGGACCAG GAGAGGGCTA 174 0 

CTGTGGTAAA TGGGGGTGTG AAACCACCGG ACAGGCTTAC TGGAAGCCCA CATCATCGTG 180 0 

GGACCTAATC TCCCTTAAGC GCGGTAACAC CCCCTGGGAC ACGGGATGCT CTAAAGTTGC 1860 

CTGTGGCCCC TGCTACGACC TCTCCAAAGT ATCCAATTCC TTCCAAGGGG CTACTCGAGG 1920 

GGGCAGATGC AACCCTCTAG TCCTAGAATT CACTGATGCA GGAAAAAAGG CTAACTGGGA 1980 

CGGGCCCAAA TCGTGGGGAC TGAGACTGTA CCGGACAGGA ACAGATCCTA TTACCATGTT 2 040 

CTCCCTGACC CGGCAGGTCC TTAATGTGGG ACCCCGAGTC CCCATAGGGC CCAACCCAGT 2100 

ATTACCCGAC CAAAGACTCC CTTCCTCACC AATAGAGATT GTACCGGCTC CACAGCCACC 2160 

TAGCCCCCTC AATACCAGTT ACCCCCCTTC CACTACCAGT ACACCCTCAA CCTCCCCTAC 2220 

AAGTCCAAGT GTCCCACAGC CACCCCCAGG AACTGGAGAT AGACTACTAG CTCTAGTCAA 2280 

AGGAGCCTAT CAGGCGCTTA ACCTCACCAA TCCCGACAAG ACCCAAGAAT GTTGGCTGTG 2340 

CTTAGTGTCG GGACCTCCTT ATTACGAAGG AGTAGCGGTC GTGGGCACTT ATACCAATCA 240O 

TTCCACCGCT CCGGCCAACT GTACGGCCAC TTCCCAACAT AAGCTTACCC TATCTGAAGT 2460 

GACAGGACAG GGCCTATGCA TGGGGGCAGT ACCTAAAACT CACCAGGCCT TATGTAACAC 2520 

CACCCAAAGC GCCGGCTCAG GATCCTACTA CCTTGCAGCA CCCGCCGGAA CAATGTGGGC 2580 

TTGCAGCACT GGATTGACTC CCTGCTTGTC CACCACGGTG CTCAATCTAA CCACAGATTA 2 640 

TTGTGTATTA GTTGAACTCT GGCCCAGAGT AATTTACCAC TCCCCCGATT ATATGTATGG 2700 

TCAGCTTGAA CAGCGTACCA AATATAAAAG AGAGCCAGTA TCATTGACCC TGGCCCTTCT 2760 

ACTAGGAGGA TTAACCATGG GAGGGATTGC AGCTGGAATA GGGACGGGGA CCACTGCCTT 282 0 

AATTAAAACC CAGCAGTTTG AGCAGCTTCA TGCCGCTATC CAGACAGACC TCAACGAAGT 2 880 

CGAAAAGTCA ATTACCAACC TAGAAAAGTC ACTGACCTCG TTGTCTGAAG TAGTCCTACA 2 94 0 

GAACCGCAGA GGCCTAGATT TGCTATTCCT AAAGGAGGGA GGTCTCTGCG CAGCCCTAAA 3 000 

AGAAGAATGT TGTTTTTATG CAGACCACAC GGGGCTAGTG AGAGACAGCA TGGCCAAATT 3060 

AAGAGAAAGG CTTAATCAGA GACAAAAACT ATTTGAGACA GGCCAAGGAT GGTTCGAAGG 3120 

GCTGTTTAAT AGATCCCCCT GGTTTACCAC CTTAATCTCC ACCATCATGG GACCTCTAAT 3180 

AGTACTCTTA CTGATCTTAC TCTTTGGACC TTGCATTCTC AATCGATTAG TTCAATTTGT 3240 

TAAAGACAGG ATCTCAGTAG TCCAGGCTTT AGTCCTGACT CAACAATACC ACCAGCTAAA 3300 

GCCTATAGAG TACGAGCCAT AGGGCGCCTA GTGTTGACAA TTAATCATCG GCATAGTATA 3 3 60 

CGGCATAGTA TAATACGACT CACTATAGGA GGGCCACCAT GGCCAAGTTG ACCAGTGCCG 3 42 0 

TTCCGGTGCT CACCGCGCGC GACGTCGCCG GAGCGGTCGA GTTCTGGACC GACCGGCTCG 3 480 

GGTTCTCCCG GGACTTCGTG GAGGACGACT TCGCCGGTGT GGTCCGGGAC GACGTGACCC 3 54 0 

TGTTCATCAG CGCGGTCCAG GACCAGGTGG TGCCGGACAA CACCCTGGCC TGGGTGTGGG 3600 

TGCGCGGCCT GGACGAGCTG TACGCCGAGT GGTCGGAGGT CGTGTCCACG AACTTCCGGG 3660 

ACGCCTCCGG GCCGGCCATG ACCGAGATCG GCGAGCAGCC GTGGGGGCGG GAGTTCGCCC 3720 

TGCGCGACCC GGCCGGCAAC TGCGTGCACT TCGTGGCCGA GGAGCAGGAC TGANNKNCGG 3780 

ACCGGTCGAC TTGTTAACTT GTTTATTGCA GCTTATAATG GTTACAAATA AAGCAATAGC 3840 

ATCACAAATT TCACAAATAA AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA 3900 

CTCATCAATG TATCTTATCA TGTCTGGATC CAGATCTGGG CCCATGCGGC CGCGGATCGA 3960 

TNNNNACATG TGAGCAAAAG GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG CCGCGTTGCT 4020 

GGCGTTTTTC CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 4080 
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GAGGTGGCGA 

CGTGCGCTCT 

GGGAAGCGTG 

TCGCTCCAAG 

CGGTAACTAT 

CACTGGTAAC 

GTGGCCTAAC 

AGTTACCTTC 

CGGTGGTTTT 

TCCTTTGATC 

TTTGGTCATG 

TTTTAAATCA 

CAGTGAGGCA 

CGTCGTGTAG 

ACCGCGAGAC 

GGCCGAGCGC 

CCGGGAAGCT 

TACAGGCATC 

ACGATCAAGG 

TCCTCCGATC 

ACTGCATAAT 

CTCAACCAAG 

AATACGGGAT 

TTCTTCGGGG 

CACTCGTGCA 

AAAAACAGGA 

ACTCATACTC 

CGGATACATA 

CCGAAAAGTG 

TAGGCGTATC 

ACACATGCAG 

AGCCCGTCAG 

ATCAGAGCAG 



AACCCGACAG 

CCTGTTCCGA 

GCGCTTTCTC 

CTGGGCTGTG 

CGTCTTGAGT 

AGGATTAGCA 

TACGGCTACA 

GGAAAAAGAG 

TTTGTTTGCA 

TTTTCTACGG 

AGATTATCAA 

ATCTAAAGTA 

CCTATCTCAG 

ATAACTACGA 

CCACGCTCAC 

AGAAGTGGTC 

AGAGTAAGTA 

GTGGTGTCAC 

CGAGTTACAT 

GTTGTCAGAA 

TCTCTTACTG 

TCATTCTGAG 

AATACCGCGC 

CGAAAACTCT 

CCCAACTGAT 

AGGCAAAATG 

TTCCTTTTTC 

TTTGAATGTA 

CCACCTGACG 

ACGAGGCCCT 

CTCCCGGAGA 

GGCGCGTCAG 

ATTGTACTGA 



GACTATAAAG 

CCCTGCCGCT 

AATGCTCACG 

TGCACGAACC 

CCAACCCGGT 

GAGCGAGGTA 

CTAGAAGGAC 

TTGGTAGCTC 

AGCAGCAGAT 

GGTCTGACGC 

AAAGGATCTT 

TATATGAGTA 

CGATCTGTCT 

TACGGGAGGG 

CGGCTCCAGA 

CTGCAACTTT 

GTTCGCCAGT 

GCTCGTCGTT 

GATCCCCCAT 

GTAAGTTGGC 

TCATGCCATC 

AATAGTGTAT 

CACATAGCAG 

CAAGGATCTT 

CTTCAGCATC 

CCGCAAAAAA 

AATATTATTG 

TTTAGAAAAA 

TCTAAGAAAC 

TTCGTCTCGC 

CGGTCACAGC 

CGGGTGTTGG 

GAGTGCAC 
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2 

ATACCAGGCG 

TACCGGATAC 

CTGTAGGTAT 

CCCCGTTCAG 

AAGACACGAC 

TGTAGGCGGT 

AGTATTTGGT 

TTGATCCGGC 

TACGCGCAGA 

TCAGTGGAAC 

CACCTAGATC 

AACTTGGTCT 

ATTTCGTTCA 

CTTACCATCT 

TTTATCAGCA 

ATCCGCCTCC 

TAATAGTTTG 

TGGTATGGCT 

GTTGTGCAAA 

CGCAGTGTTA 

CGTAAGATGC 

GCGGCGACCG 

AACTTTAAAA 

ACCGCTGTTG 

TTTTACTTTC 

GGGAATAAGG 

AAGCATTTAT 

TAAACAAATA 

CATTATTATC 

GCGTTTCGGT 

TTGTCTGTAA 

CGGGTGTCGG 
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TTTCCCCCTG 
CTGTCCGCCT 
CTCAGTTCGG 
CCCGACCGCT 
TTATCGCCAC 
GCTACAGAGT 
ATCTGCGCTC 
AAACAAACCA 
AAAAAAGGAT 
GAAAACTCAC 
CTTTTAAATT 
GACAGTTACC 
TCCATAGTTG 
GGCCCCAGTG 
ATAAACCAGC 
ATCCAGTCTA 
CGCAACGTTG 
TCATTCAGCT 
AAAGCGGTTA 
TCACTCATGG 
TTTTCTGTGA 
AGTTGCTCTT 
GTGCTCATCA 
AGATCCAGTT 
ACCAGCGTTT 
GCGACACGGA 
CAGGGTTATT 
GGGGTTCCGC 
ATGACATTAA 
GATGACGGTG 
GCGGATGCCG 
GGCTGGCTTA 



GAAGCTCCCT 

TTCTCCCTTC 

TGTAGGTCGT 

GCGCCTTATC 

TGGCAGCAGC 

TCTTGAAGTG 

TGCTGAAGCC 

CCGCTGGTAG 

CTCAAGAAGA 

GTTAAGGGAT 

AAAAATGAAG 

AATGCTTAAT 

CCTGACTCCC 

CTGCAATGAT 

CAGCCGGAAG 

TTAATTGTTG 

TTGCCATTGC 

CCGGTTCCCA 

GCTCCTTCGG 

TTATGGCAGC 

CTGGTGAGTA 

GCCCGGCGTC 

TTGGAAAACG 

CGATGTAACC 

CTGGGTGAGC 

AATGTTGAAT 

GTCTCATGAG 

GCACATTTCC 

CCTATAAAAA 

AAAACCTCTG 

GGAGCAGACA 

ACTATGCGGC 
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Figure 10. FBdelPMOSAF Sequence 1 

CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT 60 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA ' 120 

CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT 180 

TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 2 40 

GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACCA 300 

CTAGAATACG AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 3 60 

GAAAGACCCC ACCAAATTGC TTACCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 42 0 

GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 480 

AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 54 0 

ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT^ 600 

GGCCCAACCC TCAGCAGTTT CTTAAGACGC ATCAGATGTT TCCAGGCTCC CCCAAGGACC 660 

TGAA-ATGACC CTGTGCCTTA TTTGAATTAA CCAATCAGCC TGCTTCTCGC TTCTGTTCGC 720 

GCGCTTCTGC TTCCCGAGCT CTATAAAAGA GCTCACAACC CCTCACTCGG CGCGCCAGTC 780 

CTCCGATAGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT GCTGTTGCAT 840 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 900 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 960 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCTTATATGG GGCACCCCCG CCCCTTGTAA 1020 

ACTTCCCTGA CCCTGACATG ACAAGAGTTA CTAACAGCCC CTCTCTCCAA GCTCACTTAC 1080 

AGGCTCTCTA CTTAGTCCAG CACGAAGTCT GGAGACCTCT GGCGGCAGCC TACCAAGAAC 1140 

AACTGGACCG ACCGGTGGTA CCTCACCCTT ACCGAGTCGG CGACACAGTG TGGGTCCGCC 12 00 

GACACCAGA.C TAAGAACCTA GAACCTCGCT GGAAAGGACC TTACACAGTC CTGCTGACCA ^2 60 

CCCCCACCGC CCTCAAAGTA GACGGCATCG CAGCTTGGAT ACACGCCGCC CACGTGAAGG 1320 

CTGCCGACCC CGGGGGTGGA CCATCCTCTA GACTGACATG GCGCGTTCAA CGCTCTCAAA 13 80 

ACCCCTTAAA AATAAGGTTA ACCCGCGAGG CCCCCTAATC CCCTTAATTC TTCTGATGCT 1440 

CAGAGGGGTC AGTACTGCTT CGCCCGGCTC CAGTCCTCAT CAAGTCTATA ATATCACCTG 1500 

GGAGGTAACC AATGGAGATC GGGAGACGGT ATGGGCAACT TCTGGCAACC ACCCTCTGTG 15 60 

GACCTGGTGG CCTGACCTTA CCCCAGATTT ATGTATGTTA GCCC ACCATG GACCATCTTA 1620 

TTGGGGGCTA GAATATCAAT CCCCTTTTTC TTCTCCCCCG GGGCCCCCTT GTTGCTCAGG 16 80 

GGGCAGCAGC CCAGGCTGTT CCAGAGACTG CGAAGAACCT TTAACCTCCC TCACCCCTCG 1740 

GTGCAACACT GCCTGGAACA GACTCAAGCT AGACCAGACA ACTCATAAAT CAAATGAGGG 1800 

ATTTTATGTT TGCCCCGGGC CCCACCGCCC CCGAGAATCC AAGTCATGTG GGGGTCCAGA 1860 

CTCCTTCTAC TGTGCCTATT GGGGCTGTGA GACAACCGGT AGAGCTTACT GGAAGCCCTC 192 0 

CTCATCATGG GATTTCATCA CAGTAAACAA CAATCTCACC TCTGACCAGG CTGTCCAGGT 1980 

ATGCAAAGAT AATAAGTGGT GCAACCCCTT AGTTATTCGG TTTACAGACG CCGGGAGACG 2040 

GGTTACTTCC TGGACCACAG GACATTACTG GGGCTTACGT TTGTATGTCT CCGGACAAGA 2100 

TCCAGGGCTT ACATTTGGGA TCCGACTCAG ATACCAAAAT CTAGGACCCC GCGTCCCAAT 2160 

AGGGCCAAAC CCCGTTCTGG CAGACCAACA GCCACTCTCC AAGCCCAAAC CTGTTAAGTC 22 20 

GCCTTCAGTC ACCAAACCAC CCAGTGGGAC TCCTCTCTCC CCTACCCAAC TTCCACCGGC 22 80 

GGGAACGGAA AATAGGCTGC TAAACTTAGT AGACGGAGCC TACCAAGCCC TCAACCTCAC 23 40 

CAGTCCTGAC AAAACCCAAG AGTGCTGGTT GTGTCTAGTA GCGGGACCCC CCTACTACGA 24 00 

AGGGGTTGCC GTCCTGGGTA CCTACTCCAA CCATACCTCT GCTCCAGCCA ACTGCTCCGT 24 60 

GGCCTCCCAA CACAAGTTGA CCCTGTCCGA AGTGACCGGA CAGGGACTCT GCATAGGAGC 2520 

AGTTCCCAAA ACACATCAGG CCCTATGTAA TACCACCCAG ACAAGCAGTC GAGGGTCCTA 2580 

TTATCTAGTT GCCCCTACAG GTACCATGTG GGCTTGTAGT ACCGGGCTTA CTCCATGCAT 2 640 

CTCCACCACC ATACTGAACC TTACCACTGA TTATTGTGTT CTTGTCGAAC TCTGGCCAAG 2700 

AGTCACCTAT CATTCCCCCA GCTATGTTTA CGGCCTGTTT GAGAGATCCA ACCGACACAA 27 60 

AAGAGAACCG GTGTCGTTAA CCCTGGCCCT ATTATTGGGT GGACTAACCA TGGGGGGAAT 2820 

TGCCGCTGGA ATAGGAACAG GGACTACTGC TCTAATGGCC ACTCAGCAAT TCCAGCAGCT 2880 

CCAAGCCGCA GTACAGGATG ATCTCAGGGA GGTTGAAAAA TCAATCTCTA ACCTAGAAAA 2940 

GTCTCTCACT TCCCTGTCTG AAGTTGTCCT ACAGAATCGA AGGGGCCTAG ACTTGTTATT 3000 

TCTAAAAGAA GGAGGGCTGT GTGCTGCTCT AAAAGAAGAA TGTTGCTTCT ATGCGGACCA 3060 

CACAGGACTA GTGAGAGACA GCATGGCCAA ATTGAGAGAG AGGCTTAATC AGAGACAGAA 3120 

ACTGTTTGAG TCAACTCAAG GATGGTTTGA GGGACTGTTT AACAGATCCC CTTGGTTTAC 3180 

CACCTTGATA TCTACCATTA TGGGACCCCT CATTGTACTC CTAATGATTT TGCTCTTCGG 3240 

ACCCTGCATT CTTAATCGAT TAGTTCAATT TGTTAAAGAC AGGATCTCAG TAGTCCAGGC 3300 

TTTAGTCCTG ACTCAACAAT ACCACCAGCT AAAGCCTATA GAGTACGAGC CATAGGGCGC 33 60 

CTAGTGTTGA CAATTAATCA TCGGCATAGT ATACGGCATA GTATAATACG ACTCACTATA 3420 

GGAGGGCCAC CATGGCCAAG TTGACCAGTG CCGTTCCGGT GCTCACCGCG CGCGACGTCG 3480 

CCGGAGCGGT CGAGTTCTGG ACCGACCGGC TCGGGTTCTC CCGGGACTTC GTGGAGGACG 3 540 

ACTTCGCCGG TGTGGTCCGG GACGACGTGA CCCTGTTCAT CAGCGCGGTC CAGGACCAGG 3 6 00 
TGGTGCCGGA CAACACCCTG GCCTGGGTGT GGGTGCGCGG CCTGGACGAG CTGTACGCCG - 3 660 

AGTGGTCGGA GGTCGTGTCC ACGAACTTCC GGGACGCCTC CGGGCCGGCC ATGACCGAGA 372 0 

TCGGCGAGCA GCCGTGGGGG CGGGAGTTCG CCCTGCGCGA CCCGGCCGGC AACTGCGTGC 3 780 

ACTTCGTGGC CGAGGAGCAG GACTGANNNN CGGACCGGTC GACTTGTTAA CTTGTTTATT 3 840 

GCAGCTTATA ATGGTTACAA ATAAAGCAAT AGCATCACAA ATTTCACAAA TAAAGCATTT 3 900 

TTTTCACTGC ATTCTAGTTG TGGTTTGTCC AAACTCATCA ATGTATGTTA TCATGTCTGG 3 960 

ATCCAGATCT GGGCCCATGC GGCCGCGGAT CGATNNNNAC ATGTGAGCAA AAGGCCAGCA 402 0 

AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC 4080 
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TGACGAGCAT 

AAGATACCAG 

GCTTACCGGA 

ACGCTGTAGG 

ACCCCCCGTT 

GGTAAGACAC 

GTATGTAGGC 

GACAGTATTT 

CTCTTGATCC 

GATTACGCGC 

CGCTCAGTGG 

CTTCACCTAG 

GTAAACTTGG 

TCTATTTCGT 

GGGCTTACCA 

AGATTTATCA 

TTTATCCGCC 

AGTTAATAGT 

GTTTGGTATG 

CATGTTGTGC 

GGGCGCAGTG 

ATCCGTAAGA 

TATGCGGCGA 

CAGAACTTTA 

CTTACCGCTG 

ATCTTTTACT 

AAAGGGAATA 

TTGAAGCATT 

AAATAAACAA 

AACCATTATT 

CGCGCGTTTC 

AGCTTGTCTG 

TGGCGGGTGT 

C 



CACAAAAATC 

GCGTTTCCCC 

TACCTGTCCG 

TATCTCAGTT 

CAGCCCGACC 

GACTTATCGC 

GGTGCTACAG 

GGTATCTGCG 

GGCAAACAAA 

AGAAAAAAAG 

AACGAAAACT 

ATCCTTTTAA 

TCTGACAGTT 

TCATCCATAG 

TCTGGCCCCA 

GCAATAAACC 

TCCATCCAGT 

TTGCGCAACG 

GCTTCATTCA 

AAAAAAGCGG 

TTATCACTCA 

TGCTTTTCTG 

CCGAGTTGCT 

AAAGTGCTCA 

TTGAGATCCA 

TTCACCAGCG 

AGGGCGACAC 

TATCAGGGTT 

ATAGGGGTTC 

ATCATGACAT 

GGTGATGACG 

TAAGCGGATG 

CGGGGCTGGC 



GACGCTCAAG 

CTGGAAGCTC 

CCTTTCTCCC 

CGGTGTAGGT 

GCTGCGCCTT 

CACTGGCAGC 

AGTTCTTGAA 

CTCTGCTGAA 

CCACCGCTGG 

GATCTCAAGA 

CACGTTAAGG 

ATTAAAAATG 

ACCAATGCTT 

TTGCCTGACT 

GTGCTGCAAT 

AGCCAGCCGG 

CTATTAATTG 

TTGTTGCCAT 

GCTCCGGTTC 

TTAGCTCCTT 

TGGTTATGGC 

TGACTGGTGA 

CTTGCCCGGC 

TCATTGGAAA 

GTTCGATGTA 

TTTCTGGGTG 

GGAAATGTTG 

ATTGTCTCAT 

CGCGCACATT 

TAACCTATAA 

GTGAAAACCT 

CCGGGAGCAG 

TTAACTATGC 



CAGAGGTGG 
CCTCGTGCGC 
TTCGGGAAGC 
CGTTCGCTCC 
ATCCGGTAAC 
AGCCACTGGT 
GTGGTGGCCT 
GCCAGTTACC 
TAGCGGTGGT 
AGATCCTTTG 
GATTTTGGTC 
AAGTTTTAAA 
AATCAGTGAG 
CCCCGTCGTG 
GATACCGCGA 
AAGGGCCGAG 
TTGCCGGGAA 
TGCTACAGGC 
CCAACGATCA 
CGGTCCTCCG 
AGCACTGCAT 
GTACTCAACC 
GTCAATACGG 
ACGTTCTTCG 
AGCCACTGGT 
AGCAAAAACA 
AATACTCATA 
GAGCGGATAC 
TCCCCGAAAA 
AAATAGGCGT 
CTGACACATG 
ACAAGCCCGT 
GGCATCAGAG 



CGAAACCCGA 
TCTCCTGTTC 
GTGGCGCTTT 
AAGCTGGGCT 
TATCGTCTTG 
AACAGGATTA 
AACTACGGCT 
TTCGGAAAAA 
TTTTTTGTTT 
ATCTTTTCTA 
ATGAGATTAT 
TCAATCTAAA 
GCACCTATCT 
TAGATAACTA 
GACCCACGCT 
CGCAGAAGTG 
GCTAGAGTAA 
ATCGTGGTGT 
AGGCGAGTTA 
ATCGTTGTCA 
AATTCTCTTA 
AAGTCATTCT 
GATAATACCG 
GGGCGAAAAC 
GCACCCAACT 
GGAAGGCAAA 
CTCTTCCTTT 
ATATTTGAAT 
GTGCCACCTG 
ATCACGAGGC 
CAGCTCCCGG 
CAGGGCGCGT 
CAGATTGTAC 



CAGGACTATA 

CGACCCTGCC 

CTCAATGCTC 

GTGTGCACGA 

AGTCCAACCC 

GCAGAGCGAG 

ACACTAGAAG 

GAGTTGGTAG 

GCAAGCAGCA 

CGGGGTCTGA 

CAAAAAGGAT 

GTATATATGA 

CAGCGATCTG 

CGATACGGGA 

CACCGGCTCC 

GTCCTGCAAC 

GTAGTTCGCC 

CACGCTCGTC 

CATGATCCCC 

GAAGTAAGTT 

CTGTCATGCC 

GAGAATAGTG 

CGCCACATAG 

TCTCAAGGAT 

GATCTTCAGC 

ATGCCGCAAA 

TTCAATATTA 

GTATTTAGAA 

ACGTCTAAGA 

CCTTTCGTCT 

AGACGGTCAC 

CAGCGGGTGT 

TGAGAGTGCA 
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CATATGCGGT 

TCGCCATTCA 

CGCCAGCTGG 

TCCCAGTCAC 

GACAGGATCT 

CTAGAATACG 

GAAAGACCCC 

GAAAAATACC 

AACGTTGGGC 

ACAGATGGTC 

GGCCCAACCC 

TGAAATGACC 

GCGCTTCTGC 

CTCCGATAGA 

CCGACTCGTG 

CGGGGGTCTT 

ACCCACCACC 

CGGCCTTTGT 

TACATTGTGC 

AAGAGACCTT 

CCTTAGCGCT 

TTCTCTATGG 

GATTTCTCCC 

GGGACCAGAT 

TCGGGGATCA 

GCCCATACCT 

GGGTCCATGC 

TGGAAAAGAC 

AATAAGAACC 

GTTGTCTGGG 

GATGTATGTG 

TCCTCTAAAC 

TGGGGAGCCA 

TACGTATGTC 

TCCCTATACT 

TCCTCAAAAG 

CAACAGTGTC 

AAATTATCCA 

CATCCAGGCG 

GGTCCTGACC 

CCTCTTCCCC 

GCGACTAGTG 

CCCACCACAG 

ACCAACCCAG 

GAAGCAATAG 

GGGACCCAAG 

GTGCCCTTTA 

CATCAGTATC 

TGCCTCTCCA 

CCTCGCATCT 

AGGACTAAAA 

GCGGGAATAG 

CTGACAAGCC 

AAGTTAGAGG 

GACTTGCTGT 

TACATAGACC 

AAAAGACAGT 

CCTTGGTTCA 

CTCATCCTCG 

GTAGTCCAGG 

CCATAGGGCG 

GACTCACTAT 

GCGCGACGTC 

CGTGGAGGAC 

CCAGGACCAG 

GCTGTACGCC 

CATGACCGAG 

CAACTGCGTG 



GTGAAATACC 

GGCTGCGCAA 

CGAAAGGGGG 

GACGTTGTAA 

CAGTAGTCCA 

AGCCACAATA 

ACCAAATTGC 

AAACCAAGAA 

CAAACAGGAT 

ACCGCGGTTC 

TCAGCAGTTT 

CTGTGCCTTA 

TTCCCGAGCT 

CTGAGTCGCC 

GTCTCGCTGT 

TCATTTGGGG 

GGGAGGTAAG 

TGCTCAGGTA 

GTATAGACCC 

GACCAAATTA 

GCTTAGGGCC 

AGGACCACCC 

TGTCTTATTT 

CAAAGAGGTG 

AGTGCTTGTC 

GGTGTTGCTG 

TTCTCACCTC 

TGATCATCCT 

CCCACCAGCC 

ATACAAAGGC 

CCTTGGCGGC 

GAGTCAGACC 

TAGGGTGCAG 

CCCGGGATGG 

GTAAAGAATG 

ACCTCATAAC 

ACCAGACCGG 

AGGACTGGAT 

TACAGTTCAC 

TCGTCCTTGT 

CAAGGGAAGC 

CACAAACTCC 

GCGACAGACT 

GGGCCACTGA 

CCTCATCAGG 

GAAAGCTCAC 

CCCATCAGCA 

TGCTCCCCTC 

CCTCAGTTTT 

ATTACTATCC 

GAGAGGCTGT 

GTACTGGTTC 

TCCAGATCGC 

ACTCACTGAC 

TTCTAAAAGA 

ACTCAGGTGC 

TAGAGCGCCA 

CTACCCTGCT 

GGCCATGCAT 

CTTTAGTCCT 

CCTAGTGTTG 

AGGAGGGCCA 

GCCGGAGCGG 

GACTTCGCCG 

GTGGTGCCGG 

GAGTGGTCGG 

ATCGGCGAGC 

CACTTCGTGG 



GCACAGATGC 

CTGTTGGGAA 

ATGTGCTGCA 

AACGACGGCC 

GGCTTTAGTC 

AATAAAAGAT 

TTAGCCTGAT 

TAGAGAAGTT 

ATCTGCGGTG 

GGCCCCGGCC 

CTTAAGACCC 

TTTGAATTAA 

CTATAAAAGA 

CGGGTACCCG 

TCCTTGGGAG 

GCTCGTCCGG 

CTGGCCAAGA 

AGTCAGGGAC 

CAGAGCTCAG 

GCCTTAGAGA 

AC-GAATACCC 

CCCATACTTG 

ACTCACTTAA 

TATAAGCCTG 

AGACGCCATC 

ACTACCCCGA 

AAACCTGCAC 

CTTAAGCTGC 

CATGACCCTC 

AGTCCAGCCC 

TAGTCTTGAG 

TCCGGACTCA 

CTACCCTCGG 

CCGGACCCTT 

GGATTGTGAG 

TGTAAAATGG 

CTGGTGTAAC 

AACGGGAAAA 

CATTCGCTTA 

GGAACAAGGA 

GCCACCGCCA 

CACGGTGAGA 

TTTTGATCTT 

GTCTTGCTGG 

AGAGGTCGCC 

CCTCACTGAG 

TCTCTGCAAT 

CAACCATAGC 

TAATCAGACT 

TGAAGAAGTT 

CTCACTTACC 

AACTGCCTTA 

CATAGATGCT 

TTCCCTGTCC 

AGGTGGCCTC 

AGTACGGGAC 

GAAAAGCCAA 

ATCAACCATC 

CATCAATCGA 

GACTCAACAA 

ACAATTAATC 

CCATGGCCAA 

TCGAGTTCTG 

GTGTGGTCCG 

ACAACACCCT 

AGGTCGTGTC 

AGCCGTGGGG 

CCGAGGAGCA 
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1 

GTAAGGAGAA 

GGGCGATCGG 

AGGCGATTAA 

AGTGAATTCC 

CTGACTCAAC 

TTTATTTAGT 

AGCCGCAGTA 

CAGATCAAGG 

AGCAGTTTCG 

CGGGGCCAAG 

ATCAGATGTT 

CCAATCAGCC 

GCTCACAACC 

TGTATCCAAT 

GGTCTCCTCA 

GATCTGGAGA 

TCCCTAAGGT 

TGGCCACTCA 

GTCAGGTAGA 

CCGGTGGAAA 

CTGGCCGGTT 

AGTCTGGAGA 

AGGCTTTAGA 

GTACCGTAAC 

GACCCAGCAG 

CCGCGGTAAA 

CACCTTCGGC 

GTATTCGGCG 

ACTTGGCAGG 

CCTTGGACTT 

TCCTGGGATA 

GACTATACTG 

GCTAGGACTA 

TCAGAAGCTA 

ACCACGGGGA 

GACCAAAATA 

CCCCTTAAAA 

ACCTGGGGAT 

AAAATCACCA 

CCTCCTAGAA 

TCTCTCCCCG 

AAAACAATTG 

GTGCAGGGGG 

CTTTGTTTGG 

TACTCCACCG 

GTCTCAGGAC 

CAGACCCTAT 

TGGTGGGCTT 

AGAGATTTCT 

TTGTTACAGG 

CTAGCTGTTT 

ATTAAAGGAC 

GACCTCCGGG 

GAGGTAGTGC 

TGTGCGGCCC 

TCCATGAAAA 

AACTGGTATG 

GCTGGGCCCC 

TTAGTTCAAT 

TACCACCAGC 

ATCGGCATAG 

GTTGACCAGT 

GACCGACCGG 

GGACGACGTG 

GGCCTGGGTG 

CACGAACTTC 

GCGGGAGTTC 

GGACTGANNN 
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AATACCGCAT 
TGCGGGCCTC 
GTTGGGTAAC 
GATTAGTTCA 
AATACCACCA 
TTCCAGAAAA 
ACGCCATTTT 
GCGGGTACAC 
GCCCCGGCCC 
AACAGATGGT 
TCCAGGCTCC 
TGCTTCTCGC 
CCTCACTCGG 
AAATCCTCTT 
GAGTGATTGA 
CCCCTGCCCA 
ACTCGGGTCA 
ACTGGGGATA 
AAGAATGAAC 
AGACTGGGTG 
TGGTTTAACT 
AACTTTGGGT 
AATTGTAAGG 
AATCCCTCAC 
CCTTGAGCCT 
AGTCGATGGT 
ACCAGATGAG 
GCGGCGGGAC 
TACTGTCCCA 
GGTGGCCCAC 
TCCCGGGAAC 
CCGCTTATAA 
GAATGGCAAG 
GAAGGTGCGG 
CCGGTTATTG 
GCGAATGGAC 
TAGATTTCAC 
TAAGATTCTA 
ACATGCCAGC 
CGTCCCTCGC 
ACTCTAACTC 
TTACCCTAAA 
CCTTCCTAAC 
CCATGGGCCC 
ACCTTGACCG 
ACGGGTTGTG 
CCATCAATTC 
GCAGCACTGG 
GTATCCAGGT 
CCTATGACAA 
TACTGGGGTT 
CTATAGACCT 
CCCTCCAAGA 
TCCAAAATAG 
TAAAGGAAGA 
AACTCAAAGA 
AAGGATGGTT 
TATTACTCCT 
TTGTTAAAGA 
TAAAGCCTAT 
TATACGGCAT 
GCCGTTCCGG 
CTCGGGTTCT 
ACCCTGTTCA 
TGGGTGCGCG 
CGGGACGCCT 
GCCCTGCGCG 
NCGGACCGGT 



CAGGCGCCAT 

TTCGCTATTA 

GCCAGGGTTT 

ATTTGTTAAA 

GCTAAAACCA 

AGGGGGGAAT 

GCAAGGCATG 

GAAAACAGCT 

GGGGCCAAGA 

CCCCAGATAT 

CCCAAGGACC 

TTCTGTTCGC 

CGCGCCAGTC 

GCTGTTGCAT 

CTACCCGTCT 

GGGACCACCG 

GACAATGGCC 

AATTGGAAGT 

AGAACAATTA 

ACCCTCCTTC 

CCTTATGAAA 

CCCGATGATA 

ACCCAAATCT 

CCGTTCCAGG 

CGGTGGAAAG 

ATTGCTGCCT 

TCCTGGGAGC 

GAGTCTGCAA 

AACTGGAGAC 

ACTTAAACCT 

CGATGTCTCG 

GCAAATCACC 

CTCTACCTTC 

GGGGCTAGAA 

GCTATCTAAA 

TCAAAAATTT 

AGACAAAGGA 

TGTGTCTGGA 

TGTGGCAGTA 

TCTCCCACCT 

CACAGCCCTG 

CACTCCGCCT 

CTTAAATGCT 

CCCTTATTAT 

GTGCCGCTGG 

CATAGGAAAG 

CTCCGGAGAC 

CCTCACCCCT 

CCAGCTGATT 

TTCTCACCCC 

GGGAATCACG 

CCAGCAAGGC 

CTCAGTCAGC 

GAGAGGCCTT 

GTGCTGTTTT 

AAAACTGGAT 

CAATAACTCC 

CCTTCTGTTG 

CAGGATCTCA 

AGAGTACGAG 

AGTATAATAC 

TGCTCACCGC 

CCCGGGACTT 

TCAGCGCGGT 

GCCTGGACGA 

CCGGGCCGGC 

ACCCGGCCGG 

CGACTTGTTA 
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4140 
4200 
4260 
4320 
4380 



p^CTTGriTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA 

ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATG^ATC^ " 

GATCCAGATC TGGGCCCATG CGGCCGCGGA TCGATN^JITOA CATG^GAGC^ 

AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT '^TTCCATArr 

CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 

ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 4 44 0 

CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCT- 4500 

I'^l^^^^^^Z ^^S^S^SZ^S ^^{'ZSIS^?! TCGTTCGCTC CAAGCTGGGC 4560 

^ ^^^^ 

4680 
4740 
4800 



TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT 
GAGTCCAACC CGGTAA.GACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGA-r-T 
AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTz^CGGC 
TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 
AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGTn' as fin 

TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT Iq^n 
ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA tUn 
TCAAAAAGGA TCTTCACCTA GATGCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 
AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 
TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT 
ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC 
TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCJ^GAAGT 
CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA 
^^^^^^ CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 
CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT 
ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC S=;9n 
t^^^J^^^T TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT 5580 
ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC llln 
n^.nn^l^^'^ GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCaItaCG C^T?S.tIIc llOO 
^S^S^^^^TA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA 
CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC 5820 
If^^'zjl'z^^ CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA lUo 
AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT ^QAn 
I^^^t'^'^ ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTCAA loOO 
TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAi^ AGTGCCACCT 6060 
GACGTCTAAG AAACCATTAT TATCATGACA TTAACCTATA AAAATAGGCG TATCACGAGG 612 0 

CCCTTTCGTC TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GcIgCTcJ?G 6180 

cagcttgtct gtaagcggat gccgggagca gaca^gcccg tcagggcgcg 62!° 
SSaSg?S? JJ<=^c<==°tg tcggggctgg cttaactatg cggcatcaga gcagattgta "00 
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CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA 
CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT 
TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 
GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACCA 
CTAGAATACG AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 
GAAAGACCCC ACCAAATTGC TTAGCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 
GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 
AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 
ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 
GGCCCAACCC TCAGCAGTTT CTTAAGACCC ATCAGATGTT TCCAGGCTCC CCCAAGGACC 
TGAAATGACC CTGTGCCTTA TTTGAATTAA CCAATCAGCC TGCTTCTCGC TTCTGTTCGC 
GCGCTTCTGC TTCCCGAGCT CTATAAAAGA GCTCACAACC CCTCACTCGG CGCGCCAGTC 
CTCCGATAGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT GCTGTTGCAT 
CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 
CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 
ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCCCCCGGGC TGCAGGAATT TATGAAATCC 
TTTATGGGGG ACCCCCCCCT TTGTCAACCT TGCTCAATTC CTTCTCCCCC TCCGATCCTA 
AGACTGATTT ACAAGCCCGA CTAAAAGGGC TGCAAGGCGT GCAGGCCCAA ATCTGGACAC 
CCCTGGCCGA ATTGTACCGG CCAGGACATC CACAAACTAG CCACCCATTT CAGGTGGGAG 
A.CTCCGTGTA CGTCCGGCGG CACCGCTCTC AAGGA.TTGGA GCCTCGTTGG AAGGGACCTT 
ACATCGTCCT GCTGACCACG CCCACCGCCA TAAAGGTTGA CGGGATCGCC GCCTGGATTC 
ACGCATCGCA CGCCAAGGCA GCCCCAAAAA CCCCTGGACC AGAAACTCCC AAAACCTGGA 
AGCTCCGCCG TTCGGAGAAC CCTCTTAAGA TAAGACTCTC CCGTGTCTGA CTGCTAATCC 
ACCTTGTCCC TGTACTAACC CAAAATGAAA CTCCCAACAG GAATGGTCAT TTTATGTAGC 
CTAATAATAG TTCGGGCAGG GTTTGACGAC CCCCGCAAGG CTATCGCATT AGTACAAAAA 
CAACATGGTA AACCATGCGA ATGCAGCGGA GGGCAGGTAT CCGAGGCCCC ACCGAACTCC 
ATCCAACAGG TAACTTGCCC AGGCAAGACG GCCTACTTAA TGACCAACCA AAAATGGAAA 
TGCAGAGTCA CTCCAAAAAT CTCACCTAGC GGGGGAGAAC TCCAGAACTG CCCCTGTAAC 
ACTTTCCAGG ACTCGATGCA CAGTTCTTGT TATACTGAAT ACCGGCAATG CAGGCGAATT 
AATAAGACAT ACTACACGGC CACCTTGCTT AAAATACGGT CTGGGAGCCT CAACGAGGTA 
CAGATATTAC AAAACCCCAA TCAGCTCCTA CAGTCCCCTT GTAGGGGCTC TATAAATCAG 
CCCGTTTGCT GGAGTGCCAC AGCCCCCATC CATATCTCCG ATGGTGGAGG ACCCCTCGAT 
ACTAAGAGAG TGTGGACAGT CCAAAAAAGG CTAGAACAAA TTCATAAGGC TATGACTCCT 
GAACTTCAAT ACCACCCCTT AGCCCTGCCC AAAGTCAGAG ATGACCTTAG CCTTGATGCA 
CGGACTTTTG ATATCCTGAA TACCACTTTT AGGTTACTCC AGATGTCCAA TTTTAGCCTT 
GCCCAAGATT GTTGGCTCTG TTTAAAACTA GGTACCCCTA CCCCTCTTGC GATACCCACT 
CCCTCTTTAA CCTACTCCCT AGCAGACTCC CTAGCGAATG CCTCCTGTCA GATTATACCT 
CCCCTCTTGG TTCAACCGAT GCAGTTCTCC AACTCGTCCT GTTTATCTTC CCCTTTCATT 
AACGATACGG AACAAATAGA CTTAGGTGCA GTCACCTTTA CTAACTGCAC CTCTGTAGCC 
AATGTCAGTA GTCCTTTATG TGCCCTAAAC GGGTCAGTCT TCCTCTGTGG AAATAACATG 
GCATACACCT ATTTACCCCA AAACTGGACC AGACTTTGCG TCCAAGCCTC CCTCCTCCCC 
GACATTGACA TCAACCCGGG GGATGAGCCA GTCCCCATTC CTGCCATTGA TCATTATATA 
CATAGACCTA AACGAGCTGT ACAGTTCATC CCTTTACTAG CTGGACTGGG AATCACCGCA 
GCATTCACCA CCGGAGCTAC AGGCCTAGGT GTCTCCGTCA CCCAGTATAC AAAATTATCC 
CATCAGTTAA TATCTGATGT CCAAGTCTTA TCCGGTACCA TACAAGATTT ACAAGACCAG 
GTAGACTCGT TAGCTGAAGT AGTTCTCCAA AATAGGAGGG GACTGGACCT ACTAACGGCA 
GAACAAGGAG GAATTTGTTT AGCCTTACAA GAAAAATGCT GTTTTTATGC TAACAAGTCA 
GGAATTGTGA GAAACAAAAT AAGAACCCTA CAAGAAGAAT TACAAAAACG CAGGGAAAGC 
CTGGCAACCA ACCCTCTCTG GACCGGGCTG CAGGGCTTTC TTCCGTACCT CCTACCTCTC 
CTGGGACCCC TACTCACCCT CCTACTCATA CTAACCATTG GGCCATGCGT TTTCAGTCGC 
CTCATGGCCT TCATTAATGA TAGACTTAAT GTTGTACATG CCATGGTGCT GGCCCAGCAA 
TACCAAGCAC TCAAAGCTGA GGAAGAAGCT CAGGATTGAG GCGCCTAGTG TTGACAATTA 
ATCATCGGCA TAGTATACGG CATAGTATAA TACGACTCAC TATAGGAGGG CCACCATGGC 
CAAGTTGACC AGTGCCGTTC CGGTGCTCAC CGCGCGCGAC GTCGCCGGA.G CGGTCGAGTT 
CTGGACCGAC CGGCTCGGGT TCTCCCGGGA CTTCGTGGAG GACGACTTCG CCGGTGTGGT 
CCGGGACGAC GTGACCCTGT TCATCAGCGC GGTCCAGGAC CAGGTGGTGC CGGACAACAC 
CCTGGCCTGG GTGTGGGTGC GCGGCCTGGA CGAGCTGTAC GCCGAGTGGT CGGAGGTCGT 
GTCCACGAAC TTCCGGGACG CCTCCGGGCC GGCCATGACC GAGATCGGCG AGCAGCCGTG 
GGGGCGGGAG TTCGCCCTGC GCGACCCGGC CGGCAACTGC GTGCACTTCG TGGCCGAGGA 
GCAGGACTGA 1»3NNNCGGACC GGTCGACTTG TTAACTTGTT TATTGCAGCT TATAATGGTT 
ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA 
GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCCAG ATCTGGGCCC 
ATGCGGCCGC GGATCGATNN NNACATGTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT 
AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 
AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT 
CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG 
TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG TAGGTATCTC 
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AGTTCGGTGT 

GACCGCTGCG 

TCGCCACTGG 

ACAGAGTTCT 

TGCGCTCTGC 

CAAACCACCG 

AAAGGATCTC 

AACTCACGTT 

TTAAATTAAA 

AGTTACCAAT 

ATAGTTGCCT 

CCCAGTGCTG 

AACCAGCCAG 

CAGTCTATTA 

AACGTTGTTG 

TTCAGCTCCG 

GCGGTTAGCT 

CTCATGGTTA 

TCTGTGACTG 

TGCTCTTGCC 

CTCATCATTG 

TCCAGTTCGA 

AGCGTTTCTG 

ACACGGAAAT 
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ACATTAACCT 

GACGGTGAAA 
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TGGCTTAACT 



AGGTCGTTCG 

CCTTATCCGG 

CAGCAGCCAC 

TGAAGTGGTG 

TGAAGCCAGT 

CTGGTAGCGG 

AAGAAGATCC 

AAGGGATTTT 
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GCTTAATCAG 

GACTCCCCGT 

CAATGATACC 
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ATTGTTGCCG 

CCATTGCTAC 

GTTCCCAACG 

CCTTCGGTCC 

TGGCAGCACT 

GTGAGTACTC 
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GAAAACGTTC 

TGTAACCCAC 

GGTGAGCAAA 

GTTGAATACT 

TCATGAGCGG 

CATTTCCCCG 

ATAAAAATAG 

ACCTCTGACA 

GCAGACAAGC 

ATGCGGCATC 



CTCCAAGCTG 

TAACTATCGT 

TGGTAACAGG 

GCCTAACTAC 

TACCTTCGGA 

TGGTTTTTTT 

TTTGATGTTT 

GGTCATGA.GA 

TAAATCAATC 

TGAGGCACCT 

CGTGTAGATA 

GCGAGACCCA 

CGAGCGCAGA 

GGAAGCTAGA 

AGGCATCGTG 

ATCAAGGCGA 

TCCGATGGTT 

GCATAATTCT 

AACCAAGTCA 

ACGGGATAAT 

TTCGGGGCGA 

TCGTGCACCC 

AACAGGAAGG 

CATACTCTTC 

ATACATATTT 

AAAAGTGCCA 

GCGTATCACG 

CATGCAGCTC 

CCGTCAGGGC 

AGAGCAGATT 
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GGCTGTGTGC 
CTTGAGTCCA 
ATTAGCAGAG 
GGCTACACTA 
AAAAGAGTTG 
GTTTGCAAGC 
TCTACGGGGT 
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CGCTCACCGG 
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GTAAGTAGTT 
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ACGAACCCCC 
ACCCGGTAAG 
CGAGGTATGT 
GAAGGACAGT 
GTAGCTCTTG 
AGCAGATTAC 
CTGACGCTCA 
GGATCTTCAC 
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TCTGTCTA.TT 
GGGAGGGCTT 
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CGCCAGTTAA 
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GTGTTGGCGG 
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Figure 13. hCMVlOAl Sequence I 

AGATCTCCCG ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA 60 
AGCCAGTATC TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT ' 12 0 

TAAGCTACAA CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG 180 

CGTTTTGCGC TGCTTCGCGA TGTACGGGCC AGATATACGC GTTGACATTG ATTATTGACT 2 40 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 300 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 3 60 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 42 0 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 480 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 5^0 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 600 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 6 60 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 720 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 780 

CGGTGGGAGG TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTAACTG 840 

GCTTATCGAA ATGTCGACTG AGAACTTCAG GGTGAGTTTG GGGACCCTTG ATTGTTCTTT 900 

CTTTTTCGCT ATTGTAAAAT TCATGTTATA TGGAGGGGGC AAAGTTTTCA GGGTGTTGTT 960 

TAGAATGGGA AGATGTCCCT TGTATCACCA TGGACCCTCA TGATAATTTT GTTTCTTTCA 1020 

CTTTCTACTC TGTTGACAAC CATTGTCTCC TCTTATTTTC TTTTCATTTT CTGTAACTTT 1080 

TTCGTTAAAC TTTAGCTTGC ATTTGTAACG AATTTTTAAA TTCACTTTTG TTTATTTGTC 1140 

AGATTGTAJSlG TACTTTCTCT AATCACTTTT TTTTCAAGGC AATCAGGGTA TATTATATTG 12 00 

TACTTCAGCA CAGTTTTAGA GAACAATTGT TATAATTAAA TGATAAGGTA GAATATTTCT 12 60 

GCATATAAAT TCTGGCTGGC GTGGAAATAT TCTTATTGGT AGAAACAACT ACATCCTGGT 1320 

CATCATCCTG CCTTTCTCTT TATGGTTACA ATGATATACA CTGTTTGAGA TGAGGATAAA 13 80 

ATACTCTGAG TCCAAACCGG GCCCCTCTGC TAACCATGTT CATGCCTTCT TCTTTTTCCT 1440 

ACAGCTCCTG GGCAACGTGC TGGTTGTTGT GCTGTCTCAT CATTTTGGCA AGGATCGGCC 1500 

GGAACAGCAT CAGGACCGAC ATGGAAGGTC CAGCGTTCTC AAAACCCCTT AAAGATAAGA 15 60 

TTAACCCGTG GAAGTCCTTA ATGGTCATGG GGGTCTATTT AAGAGTAGGG ATGGCAGAGA 1620 

GCCCCCATCA GGTCTTTAAT GTAACCTGGA GAGTCACCAA CCTGATGACT GGGCGTACCG 1680 

CCAATGCCAC CTCCCTTTTA GGAACTGTAC AAGATGCCTT CCCAAGATTA TATTTTGATC 1740 

TATGTGATCT GGTCGGAGAA GAGTGGGACC CTTCAGACCA GGAACCATAT GTCGGGTATG 1800 

GCTGCAAATA CCCCGGAGGG AGAAAGCGGA CCCGGACTTT TGACTTTTAC GTGTGCCCTG 1860 

GGCATACCGT AAAATCGGGG TGTGGGGGGC CAAGAGAGGG CTACTGTGGT GAATGGGGTT 1920 

GTGAAACCAC CGGACAGGCT TACTGGAAGC CCACATCATC ATGGGACCTA ATCTCCCTTA 19 80 

AGCGCGGTAA CACCCCCTGG GACACGGGAT GCTCCAAAAT GGCTTGTGGC CCCTGCTACG 2040 

ACCTCTCCAA AGTATCCAAT TCCTTCCAAG GGGCTACTCG AGGGGGCAGA TGCAACCCTC 2100 

TAGTCCTAGA ATTCACTGAT GCAGGAAAAA AGGCTAATTG GGACGGGCCC AAATCGTGGG 2160 

GACTGAGACT GTACCGGACA GGAACAGATC CTATTACCAT GTTCTCCCTG ACCCGCCAGG 2220 

TCCTCAATAT AGGGCCCCGC ATCCCCATTG GGCCTAATCC CGTGATCACT GGTCAACTAC 22 80 

CCCCCTCCCG ACCCGTGCAG ATCAGGCTCC CCAGGCCTCC TCAGCCTCCT CCTACAGGCG 23 40 

CAGCCTCTAT AGTCCCTGAG ACTGCCCCAC CTTCTCAACA ACCTGGGACG GGAGACAGGC 2400 

TGCTAAACCT GGTAGAAGGA GCCTATCAGG CGCTTAACCT CACCAATCCC GACAAGACCC 2460 

AAGAATGTTG GCTGTGCTTA GTGTCGGGAC CTCCTTATTA CGAAGGAGTA GCGGTCGTGG 2520 
GCACTTATAC CAATCATTCT ACCGCCCCGG CCAGCTGTAC GGCCACTTCC CAACATAAGC . ' 2580. 

TTACCCTATC TGAAGTGACA GGACAGGGCC TATGCATGGG AGCACTACCT AAAACTCACC 2640 

AGGCCTTATG TAACACCACC CAAAGTGCCG GCTCAGGATC CTACTACCTT GCAGCACCCG 2700 

CTGGAACAAT GTGGGCTTGT AGCACTGGAT TGACTCCCTG CTTGTCCACC ACGATGCTCA 2760 

ATCTAACCAC AGACTATTGT GTATTAGTTG AGCTCTGGCC CAGAATAATT TACCACTCCC 2820 

CCGATTATAT GTATGGTCAG CTTGAACAGC GTACCAAATA TAAGAGGGAG CCAGTATCGT 2880 

TGACCCTGGC CCTTCTGCTA GGAGGATTAA CCATGGGAGG GATTGCAGCT GGAATAGGGA 2940 

CGGGGACCAC TGCCCTAATC AAAACCCAGC AGTTTGAGCA GCTTCACGCC GCTATCCAGA 3000 

CAGACCTCAA CGAAGTCGAA AAATCAATTA CCAACCTAGA AAAGTCACTG ACCTCGTTGT 3060 

CTGAAGTAGT CCTACAGAAC CGAAGAGGCC TAGATTTGCT CTTCCTAAAA GAGGGAGGTC 3120 

TCTGCGCAGC CCTAAAAGAA GAATGTTGTT TTTATGCAGA CCACACGGGA CTAGTGAGAG 3180 

ACAGCATGGC CAAACTAAGG GAAAGGCTTA ATCAGAGACA AAAACTATTT GAGTCAGGCC 3240 

AAGGTTGGTT CGAAGGGCAG TTTAATAGAT CCCCCTGGTT TACCACCTTA ATCTCCACCA 3300 

TCATGGGACC TCTAATAGTA CTCTTACTGA TCTTACTCTT TGGACCCTGC ATTCTCAATC 33 60 

GATTAGTTCA ATTTGTTAAA GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC 3420 

AATACCACCA GCTAAAGCCT ATAGAGTACG AGCCATAGGG CGCCTAGTGT TGACAATTAA 3 4 80 

TCATCGGCAT AGTATACGGC ATAGTATAAT ACGACTCACT ATAGGAGGGC CACCATGGCC 3 5 40 

AAGTTGACCA GTGCCGTTCC GGTGCTCACC GCGCGCGACG TCGCCGGAGC GGTCGAGTTC 3 600 

TGGACCGACC GGCTCGGGTT CTCCCGGGAC TTCGTGGAGG ACGACTTCGC CGGTGTGGTC 3 6 60 

CGGGACGACG TGACCCTGTT CATCAGCGCG GTCCAGGACC AGGTGGTGCC GGACAACACC 3720 

CTGGCCTGGG TGTGGGTGCG CGGCCTGGAC GAGCTGTACG CCGAGTGGTC GGAGGTCGTG 3780 

TCCACGAACT TCCGGGACGC CTCCGGGCCG GCCATGACCG AGATCGGCGA GCAGCCGTGG 3840 

GGGCGGGAGT TCGCCCTGCG CGACCCGGCC GGCAACTGCG TGCACTTCGT GGCCGAGGAG 3 900 

CAGGACTGAN NNNCGGACCG GTCGA 3 925 



BNSDOCID; <WO_9708330A1_L> 



Il^»NATIONAL SEARCH REPORT 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/86 C12N5/10 C12N15/67 



According to Intenialional Patent ClaMificaaon (IPC) or to both national classincation and IPC 



_J_PC 



item aJ Application No 

PCT/GB 96/02061 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by dawificadon symbols) 

IPC 6 C12N 



Documentation searched other than minimum documentation to Che extent thai such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and. where practical, search terms used)" 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 



JOURNAL OF VIROLOGY 69 (7). 1995. 
4086-4094. ISSN: 0O22-538X, 
July 1995, XP002O23654 
LUUKKONEN B G M ET AL: "Efficiency of 
reinitiation of translation on human 
inmunodeficiency virus type 1 mRNAs is 
determined by the length of the upstream 
open reading frame and by intercistronic 
distance. " 

see the whole document 

VIROLOGY (1995), 208(1), 215-25 CODEN: 
VIRLAX;ISSN: 0042-6822, 

1 April 1995, XPO02O23555 
HERZOG, ETIENNE ET AL: "Translation of 
the second gene of peanut clump virus RNA 
2 occurs by leaky scanning in vitro" 
see the whole document 



X] Further documents are listed in the continuation of box C. 



Relevant to claim No. 



1-29 



1-29 



0 



Patent family members are listed in annex. 



Special categories of cited documents : 

'A' doctiment defining the general state of the art which is not 
considered to be of particular relevance 

'E' earlier document but published on or after the international 
filing date 

document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

document referring to an oral disclosure, use, exhibition or 
other means 

"P* document published prior to the international filing date but 
later than the priority' date claimed 



Date of the acctial completion of the international search 



23 January 1997 



'T" later document published after the international filing date 
or priority date and not m conflict with the application hut 
cited to understand the principle or theory underlyine the 
invention 

'X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
doctiment is combined with one or more other su<^ docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. ( + 3U70) 340-2040, Tx. 31 6S1 epo nl, 
Fax:(+3U70) 340-3016 



Form Ptrr/IS A/310 <cceond shest) <July 1993) 



Date of mading of the international search report 

1 2. 02. 37 



Authorized officer 



Hornig, H 



page 1 of 3 



BNSDOCID; <WO 9708330A 1J_> 



ATIONAL SEARCH REPORT 



^ Application No 

PCT/GB 96/02061 



C.(ContinuaQon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category " CiUtion of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



J. VIROL. (1993), 57(8). 4886-95 CODEN: 
JOVIAM;ISSN: 0G22-538X, 
August 1993, XP000616337 
FOUILLOT, NATHALIE ET AL: "Translation of 
the hepatitis B virus P gene by ribosomal 
scanning as an alternative to internal 
initiation " 

see the whole document 
VIROLOGY, 

vol. 188, no. 1, May 1992, ACADEMIC PRESS, 

INC., NEW YORK, US, 

pages 342-352, XP002023656 

C.-G. LIN AND S.J. LO: "Evidence for 

involvement of a ribosomal leaky scanning 

mechanism in the translation of the 

hepatitis B virus Pol gene from the viral 

pregenome RNA" 

see the whole document 

VIROLOGY, 

vol. 185, no. 2, December 1991, ACADEMIC 
PRESS, INC., NEW YORK. US, 
pages 862-866. XP000616129 
F.-L. COSSET ET AL. : "Newcastle disease 
virus (NDV) vaccine based on immunization 
with avian cells expressing the NDV 
hemagg 1 u 1 1 n i n -neurami n 1 da se gl ycoprote i n " 
cited in the application 
see the whole document 

MOL. CELL. BIOL., 

vol. 7, no. 10, October 1987, ASM 

WASHINGTON, DC, US. 

pages 3438-3445, XPOO0516288 

M. KOZAK: "Effects of interci stronic 

length on the efficiency of reinitiation 

by eucaryotic ribosomes" 

cited in the application 

see the whole document 

EXPRIMENTAL CELL RESEARCH, 

vol. 197, no. 2. December 1991, ACADEMIC 

PRESS INC., NEW YORK. US, 

pages 229-233. XPG0O616323 

M. IZUMI ET AL.: "Blasticidin 

S-resi stance gene (bsr): a novel 

selectable marker for mammalian cells" 

cited in the application 

see the whole document 



■/-- 



1-29 



1-29 



1-29 



1-29 



1-29 



Form PCT/1SA.-2I0 (oontinuation of teoMid chest) {July 1993) 

page 2 of 3 

BNSCXXID: ■:WO_970B330A1J_> 



NATIONAL SEARCH REPORT 



vrnten 



Application No 

PCT/bB 96/02G61 



C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, of the i-elevant passages 



Relevant to claim No: 



P.x 



NATURE (LONDON) (1984), 309(5963). 82-5 
CODEN: NATUAS;ISSN: 0028-0836, 
3 May 1984, XP002023557 
LIU, CHUNG CHENG ET AL: "Initiation of 
translation at internal AUG codons in 
mamnalian cells" 
see the whole document 

WO, A, 94 24870 (BIOTRANSPLANT INC ;GEN 
HOSPITAL CORP (US); LE 6UERN CHRISTIAN A 
(U) 10 November 1994 
see the whole document 

WO, A, 93 03143 (ANDERSON W FRENCH ;MORGAN 
RICHARD A (US); COUTURE LARRY (US)) 18 
February 1993 
see the whole document 

WO, A, 94 23048 (US HEALTH ;E1DEN MARYBETH V 
(US); WILSON CAROLYN A (US); DEACON NI) 13 
October 1994 
see the whole document 

PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF USA, 

vol. 85, no. 17, 1 September 1988, 

pages 6460-6464, XP0OO569693 

DANOS 0 ET AL: "SAFE AND EFFICIENT 

GENERATION OF RECOMBINANT RETROVIRUSES 

WITH AMPHOTROPIC AND ECOTROPIC HOST 

RANGES" 

see the whole document 

J. VIROL. ( 1995 ), 69(12), 7430-6 CODEN: 
JOVIAM;ISSN: 0O22-538X, 
December 1995, XPO00569527 
COSSET. FRANCOIS-LOIC ET AL: "High-titer 
packaging cells producing recombinant 
retroviruses resistant to human serum" 
see the whole document 



1-29 



1-29 



1-29 



1-29 



1-29 



1-29 



Forai PCT.1SA,/2I0 (Gontinuaiion of secood sheet) (July 1992) 

BNSDCX:iD: <WO 9708330A1_I_> 



page 3 of 3 



IN1MNATIONAL SEARCH REPORT 

..ormation on patent family members 




al Application No 

PCT/GB 96/02061 



Patent document 
cited in search report 



Publication 

date 



Patent family 
member(s) 



Publication 

date 



WO-A-9424870 


10-11-94 


AU-A- 
CA-A- 
... EP-A- 


6230394 
2162056 
0706319 


21-11-94 
10-11-94 
17-04-96 


WO-A-9303143 


18-02-93 


CA-A- 
EP-A- 
JP-T- 


2114416 
0598029 
6509713 


18-02-93 
25-05-94 
02-11-94 


WO-A-9423048 


13-10-94 


AU-A- 
CA-A- 
EP-A- 
JP-T- 


6903194 
2160034 
0699240 
8511156 


24-10-94 
13-10-94 
06-03-96 
26-11-96 



Fom PCT;iSA.-*310 (patent family annex) (July 1992) 

BNSDOCID: <WO_9708330A1_L> 



# 



